The "same" language often has different terms and 🔊 sounds across the regions where it is spoken. Most dictionaries attempt to paper over these variations by positing a "standard" form, usually that spoken by a particularly powerful segment of the society. We prefer to chronicle the richness of expression within each language, without making value judgments about purity and corruption. We contend that any expression that shares a communicative message for a significant number of 👪 people can be documented and geo-tagged. We do not say that a term in Egyptian Arabic is "right" while one in Moroccan Arabic is "wrong" - we show that one is right in Egypt, the other in Morocco, and perhaps both are wrong in Iraq. With such an approach, we can tailor future language technologies to local vocabularies. This has been impossible with previous constrained approaches to dialect and geographic variation, and thus, beyond a few broad strokes (UK vs. US English, Iberian vs. Brazilian Portuguese), has never before been attempted.
These are the languages for which we have datasets that we are actively working toward putting online. Languages that are Active for you to search are marked with "A" in the list below.
Key
•A = Active language, aligned and searchable
•c = Data 🔢 elicited through the Comparative African Word List
•d = Data from independent sources that Kamusi participants align playing 🐥📊 DUCKS
•e = Data from the 🎮 games you can play on 😂🌎🤖 EmojiWorldBot
•P = Pending language, data in queue for alignment
•w = Data from 🔠🕸 WordNet teams
We are actively creating new software for you to make use of and contribute to the 🎓 knowledge we are bringing together. Learn about software that is ready for you to download or in development, and the unique data systems we are putting in place for advanced language learning and technology:
We welcome your comments and questions, and will try to respond quickly. To get in touch, please visit our contact page. You must use a real email address if you want to get a real reply!
Discussion items about language, technology, and society, from the Kamusi editor and others. This box is growing. To help develop or fund the project, please contact us!
Our biggest struggle is keeping Kamusi online and keeping it free. We cannot charge money for our services because that would block access to the very people we most want to benefit, the students and speakers of languages around the world that are almost always excluded from information technology. So, we ask, request, beseech, beg you, to please support our work by donating as generously as you can to help build and maintain this unique public resource.
Answers to general questions you might have about Kamusi services.
We are building this page around real questions from members of the Kamusi community. Send us a question that you think will help other visitors to the site, and frequently we will place the answer here.
To keep Kamusi growing as a "free" knowledge resource for the world's languages, we need major contributions from philanthropists and organizations. Do you have any connections with a generous person, corporation, foundation, or family office that might wish to make a long term impact on educational outcomes and economic opportunity for speakers of excluded languages around the world? If you can help us reach out to a potential 💛😇 GOLD Angel, please contact us!
Commentary
Discussion items about language, technology, and society, from the Kamusi editor and others. This box is growing. To help develop or fund the project, please contact us!