This is a page from the Kamusi archives. The information below may be out of date, and the links may no longer be valid. Please visit kamusi.org for current information. If you know of links or information on this page that can be updated, please let us know.
1.2 Million New Records. Good, but not good enough.
Now you can find your words with confidence across a large and growing collection of languages.
We are extremely excited to announce a big leap in the language resources we offer you from the Kamusi Project – 1.2 million new records that link over 100,000 concepts among more than 20 languages. This is a stepping-stone, not an end point. Let us explain what we've done, and where this puts us on the path to completely new possibilities in building language knowledge and advanced communication tools for you.
What's New: The data we just imported comes from the Open Multilingual Wordnet (OMW), which provides the results of the tremendous efforts of numerous people to produce equivalents in their languages for the concepts in the path-breaking Princeton Wordnet of English (PWN). What we have done that is new is:
What's Needed: Wordnet was never intended to be a definitive dictionary, for English or any other language. The data is excellent seed, but there is room for growth and improvement in many directions:
In sum, we are pleased to present you with this new and unique resource, but we cannot rest here. In 2013 we demonstrated our proof-of-concept with 100 parallel terms in 20 languages. The million+ Big Data beta we now announce shows the system can scale. As we reach toward our long term goal, every word in every language, this new data opens the gates for much more work ahead. We do not have nearly all the information you need about the terms we have already imported. We do not have fully comprehensive terms for English or any of the other languages we are making available today. And we are still missing most of the world's languages; we've now harvested generously available open, digitized data for many developed languages, but seek to gather similar knowledge for thousands of languages spoken by billions of people. With your support, we look forward to using the systems we have developed to fill in all of these gaps – and meanwhile, we invite you to explore the window into the world's languages that Kamusi now opens for you.
** We have designed methods to accommodate languages that are spoken but do not have a formal or agreed writing system. Devising methods to index the various sign languages developed by deaf communities around the world is high on our agenda.