Wikipedia is one of the world’s largest and perhaps the biggest crowd-sourced information repository. However, it is heavily English-centric. Today, Wikipedia has more than 14 Million articles in over 270 languages. Wikipedia lists 3.36+ million pages in English, while a long tail of the bottom 200+ languages constitute a collective 2 million articles. WikiBhasha helps the process of enhancing non-English Wikipedia content, and provides an easy to use, intuitive interface to create local language content without the user ever having to leave the Wikipedia site.
WikiBhasha is multilingual content creation tool for Wikipedia and enables easy and quick translation of articles from English to more than 30 other languages that are supported by Microsoft’s Machine Translation system and Microsoft’s Collaborative Translation Framework. WikiBhasha is much more than a translator and also allows users to create new articles from scratch. The browser based application features a UI layer that stays on the target language Wikipedia for the entire content creation process. It enables Wikipedia users and contributors to explore and source content from English Wikipedia articles, translate the content into a set of target languages, and use the content with user additions and corrections for contribution to the target language Wikipedia.
WikiBhashaâ€”Wiki,signifying its community-oriented approach; Bhasha,a Sanskrit word meaning languageâ€”was developed by Microsoft Research India. It started as a research prototype with a text-based interface. The team behind WikiBhasha is led by A Kumaran, Research Manager in the Multilingual Systems Research group at Microsoft Research India.
WikiBhasha might open the doors to a whole new world of content translation into languages that machine translators now ignore. The goal is to enhance the Wikipedia content, as well as increasing availability of multilingual content.