To be held in association with WWW2012 in Lyon, France, 17th April 2012
Sponsored by ACL SIGWAC, http://www.sigwac.org.uk
More and more people are using Web data for linguistic and NLP research: the Web provides an easy
source of linguistic data in a great variety of languages. However, a ‘crawl’ is not ready for exploration
in the same way a traditional ‘corpus’ is. We need to turn a crawl into a corpus. The workshop, the seventh
in an annual series, provides a venue for exploring what it involves, how to do it, and what we find out if we do.