ACL Logo ACL Anthology
A Digital Archive of Research Papers in Computational Linguistics

Google search the Anthology

Special Interest Group on Web as Corpus (SIGWAC)

To SIGWAC Home Page

» Toggle Table of Contents

2014 Proceedings of the 9th Web as Corpus Workshop (WaC-9)
2010 Proceedings of the NAACL HLT 2010 Sixth Web as Corpus Workshop
2007 WAC3, Louvain-la-Neuve, Belgium, 15-16 September 2007
2006 Proceedings of the 2nd International Workshop on Web as Corpus
2005 WAC1, at Corpus Linguistics conference, Birmingham, UK, July 2005

2014

  1. Proceedings of the 9th Web as Corpus Workshop (WaC-9)

  2. W14-04 [bib]: Entire Volume
  3. W14-0400 [bib]: Front Matter

  4. W14-0401 [bib]: Adrien Barbaresi
    Finding Viable Seed URLs for Web Corpora: A Scouting Approach and Comparative Study of Available Sources
  5. W14-0402 [bib]: Roland Schäfer; Adrien Barbaresi; Felix Bildhauer
    Focused Web Corpus Crawling
  6. W14-0403 [bib]: Maik Stührenberg
    Less Destructive Cleaning of Web Documents by Using Standoff Annotation
  7. W14-0404 [bib]: Magali Sanches Duran; Lucas Avanço; Sandra Aluísio; Thiago Pardo; Maria da Graça Volpe Nunes
    Some Issues on the Normalization of a Corpus of Products Reviews in Portuguese
  8. W14-0405 [bib]: Nikola Ljubešić; Filip Klubička
    {bs,hr,sr}WaC - Web Corpora of Bosnian, Croatian and Serbian
  9. W14-0406 [bib]: Verena Lyding; Egon Stemle; Claudia Borghetti; Marco Brunello; Sara Castagnoli; Felice Dell'Orletta; Henrik Dittmann; Alessandro Lenci; Vito Pirrelli
    The PAISÀ Corpus of Italian Web Texts
  10. W14-0407 [bib]: Varvara Magomedova; Natalia Slioussar; Maria Kholodilova
    Internet Data in a Study of Language Change and a Program Helping to Work with Them

2010

  1. Proceedings of the NAACL HLT 2010 Sixth Web as Corpus Workshop

  2. W10-15 [bib]: Entire Volume
  3. W10-1500 [bib]: Front Matter

  4. W10-1501 [bib]: Emiliano Raul Guevara
    NoWaC: a large web-based corpus for Norwegian
  5. W10-1502 [bib]: Markus Dickinson; Ross Israel; Sun-Hee Lee
    Building a Korean Web Corpus for Analyzing Learner Language
  6. W10-1503 [bib]: Amit Goyal; Jagadeesh Jagaralamudi; Hal Daumé III; Suresh Venkatasubramanian
    Sketching Techniques for Large Scale NLP
  7. W10-1504 [bib]: George Dillon
    Building Webcorpora of Academic Prose with BootCaT
  8. W10-1505 [bib]: Stefan Evert
    Google Web 1T 5-Grams Made Easy (but not for the computer)

2007

  1. WAC3, Louvain-la-Neuve, Belgium, 15-16 September 2007

    To Meeting Home Page

2006

  1. Proceedings of the 2nd International Workshop on Web as Corpus

  2. W06-1700: Front Matter

  3. W06-1701: András Kornai; Péter Halácsy; Viktor Nagy; Csaba Oravecz; Viktor Trón; Dániel Varga
    Web-based frequency dictionaries for medium density languages
  4. W06-1702: Mike Cafarella; Oren Etzioni
    BE: A search engine for NLP research
  5. W06-1703: Masatsugu Tonoike; Mitsuhiro Kida; Toshihiro Takagi; Yasuhiro Sasaki; Takehito Utsuro ; S. Sato
    A comparative study on compositional translation estimation using a domain/topic-specific corpus collected from the Web
  6. W06-1704: Gemma Boleda; Stefan Bott; Rodrigo Meza; Carlos Castillo; Toni Badia; Vicente López
    CUCWeb: A Catalan corpus built from the Web
  7. W06-1705: Paul Rayson; James Walkerdine; William H. Fletcher; Adam Kilgarriff
    Annotated Web as corpus
  8. W06-1706: Arno Scharl; Albert Weichselbraun
    Web coverage of the 2004 US Presidential election
  9. W06-1707: Cédrick Fairon
    Corporator: A tool for creating RSS-based specialized corpora
  10. W06-1708: Davide Fossati; Gabriele Ghidoni; Barbara Di Eugenio; Isabel Cruz; Huiyong Xiao; Rajen Subba
    The problem of ontology alignment on the Web: A first report
  11. W06-1709: Kie Zuraw
    Using the Web as a phonological corpus: A case study from Tagalog
  12. W06-1710: Rüdiger Gleim; Alexander Mehler ; Matthias Dehmer
    Web corpus mining by instance of Wikipedia

2005

  1. WAC1, at Corpus Linguistics conference, Birmingham, UK, July 2005

    To Meeting Home Page