Kamusi Labs is the development wing of the Kamusi Project. We are doing research on many tracks, developing language technologies that nobody asked for, nobody knows they need, with unknown market demand, that nobody is paying to bring into existence. Nevertheless, we are going ahead with the work. Students join virtually from around the world for internships or semester projects, and hackers volunteer for cool ways to put their coding skills to use for the greater good. Their projects are intended to solve a range of language technology problems that have previously been too pernicious, but become realizable as a result of our unique molecular data structure.
Please consult our internal whiteboard to view our list of active and upcoming projects.
We are passionately committed to equal opportunity. Kamusi Labs seeks to nurture the talent of our members regardless of race, religion, nationality, gender, or sexual orientation - we focus on your skills and enthusiasm. Young women and men have worked on coding and language projects with us from Burundi, Canada, China, France, Germany, Ghana, India, Iran, Italy, Kenya, Korea, Lebanon, Nigeria, Pakistan, Portugal, Rwanda, South Africa, Spain, Sweden, Switzerland, Tanzania, Uganda, the UK, and the US. Openness to working with a diverse group, and devotion for solving problems to open tomorrow's digital opportunities for speakers of marginalized languages, are major requirements for joining the team.
We have a long set of coding and linguistic data tasks that will result in a range of tools to use language in ways that have never before been possible. Many of our technical tasks can be implemented immediately by talented coders. We also welcome people interested in projects for particular languages, ranging from field lexicography to data mining to building models for natural language processing. We welcome volunteer hackers, or students seeking exciting research projects, to contact us to discuss how you can jump in.
Our projects span a range of interests:
These are the languages for which we have datasets that we are actively working toward putting online. Languages that are Active for you to search are marked with "A" in the list below.
•A = Active language, aligned and searchable
•c = Data 🔢 elicited through the Comparative African Word List
•d = Data from independent sources that Kamusi participants align playing 🐥📊 DUCKS
•e = Data from the 🎮 games you can play on 😂🌎🤖 EmojiWorldBot
•P = Pending language, data in queue for alignment
•w = Data from 🔠🕸 WordNet teams
We are actively creating new software for you to make use of and contribute to the 🎓 knowledge we are bringing together. Learn about software that is ready for you to download or in development, and the unique data systems we are putting in place for advanced language learning and technology:
Our biggest struggle is keeping Kamusi online and keeping it free. We cannot charge money for our services because that would block access to the very people we most want to benefit, the students and speakers of languages around the world that are almost always excluded from information technology. So, we ask, request, beseech, beg you, to please support our work by donating as generously as you can to help build and maintain this unique public resource.
Answers to general questions you might have about Kamusi services.
We are building this page around real questions from members of the Kamusi community. Send us a question that you think will help other visitors to the site, and frequently we will place the answer here.