Release of BabelNet 4.0

Event Notification Type: 
Other
Abbreviated Title: 
Location: 
State: 
Country: 
Contact Email: 
City: 
Contact: 
Roberto Navigli

=============================

RELEASE OF BABELNET 4.0

http://babelnet.org

=============================

We are proud to announce the release of a new major version of BabelNet and its API, developed jointly by the Linguistic Computing Laboratory of the Sapienza University of Rome under the supervision of prof. Roberto Navigli, and Babelscape, a Sapienza startup company providing innovative solutions for multilingual NLP. BabelNet -- winner of the prominent paper award 2017 from the Artificial Intelligence Journal and the META prize 2015, and covered in media such as The Guardian and Time magazine -- is today’s most far-reaching multilingual resource which, according to need, can be used as an encyclopedic dictionary, or a semantic network or a huge knowledge base. BabelNet was created by means of the seamless interlinking and integration of the largest multilingual Web encyclopedia - i.e., Wikipedia - with the most popular computational lexicon of English - i.e., WordNet, and other lexical semantic resources such as Wiktionary, OmegaWiki, Wikidata, Wikipedia infoboxes, dozens of wordnets, Wikiquote, FrameNet, VerbNet, Microsoft Terminology, GeoNames, and ImageNet. BabelNet provides multilingual synsets, i.e., concepts and named entities lexicalized in many languages and connected with large amounts of semantic relations.

Version 4.0 comes with the following features:

- 284 languages now covered
- Wikipedia, Wiktionary, Wikidata and OmegaWiki have been updated thanks to BabelNet live, a continuously-growing resource with daily updates from all the sources that go to make it up
- Better sense inventory thanks to the manual validation of thousands of mappings
- All existing wordnets updated
- New wordnets integrated for Gaelic, Portuguese and Korean
- Improved treatment of Chinese
- 2 million new multilingual synsets (from 14 in v3.7 to 16 million synsets in v4)
- 832 million senses (was 745 million Babel senses in v3.7, increasing language coverage considerably)
- Improved management of open wordnets that are now stored with their individual licenses
- Improved version of the Java and HTTP RESTful API (http://babelnet.org/download). The Java API comes with reengineered interfaces and classes, additional methods for Java 8 and a Java 9-ready packaging, support of the latest version of Lucene. Universal POS tags are now adopted, paving the way to synsets for closed-class words. A brand-new Python API is under development with the same interface as the Java API.

More statistics are available at: http://babelnet.org/stats.

We are organizing a two-day summer school and hackathon, with tutorials, interactive sessions and presentations targeting computational linguists, computer scientists, linguists and, more in general, BabelNet fans. We are gathering interest and preferences: if you are potentially interested, just fill in the form (https://goo.gl/forms/vbHXLmiwQ6RQtR433)! The workshop will be held either in Rome or Venice (to be decided: vote for it!).

We are looking for native speakers of German, Dutch and Chinese for linguistic annotation tasks. If you are interested, just contact us!

Kind regards,
The BabelNet team