Table of contents

Anthology Volume Year Papers
W14-04 Proceedings of the 9th Web as Corpus Workshop (WaC-9) 2014 7
W10-15 Proceedings of the NAACL HLT 2010 Sixth Web as Corpus Workshop 2010 6
W06-17 Proceedings of the 2nd International Workshop on Web as Corpus 2006 11


Pdf Export Search Proceedings of the 9th Web as Corpus Workshop (WaC-9)


Pdf Export Search Proceedings of the 9th Web as Corpus Workshop (WaC-9)
[W14-0400]: Felix Bildhauer | Roland Schäfer

Pdf Export Search Finding Viable Seed URLs for Web Corpora: A Scouting Approach and Comparative Study of Available Sources
[W14-0401]: Adrien Barbaresi

Pdf Export Search Focused Web Corpus Crawling
[W14-0402]: Roland Schäfer | Adrien Barbaresi | Felix Bildhauer

Pdf Export Search Less Destructive Cleaning of Web Documents by Using Standoff Annotation
[W14-0403]: Maik Stührenberg

Pdf Export Search Some Issues on the Normalization of a Corpus of Products Reviews in Portuguese
[W14-0404]: Magali Sanches Duran | Lucas Avanço | Sandra Aluísio | Thiago Pardo | Maria da Graça Volpe Nunes

Pdf Export Search {bs,hr,sr}WaC - Web Corpora of Bosnian, Croatian and Serbian
[W14-0405]: Nikola Ljubešić | Filip Klubička

Pdf Export Search The PAISÀ Corpus of Italian Web Texts
[W14-0406]: Verena Lyding | Egon Stemle | Claudia Borghetti | Marco Brunello | Sara Castagnoli | Felice Dell'Orletta | Henrik Dittmann | Alessandro Lenci | Vito Pirrelli



Pdf Export Search Proceedings of the NAACL HLT 2010 Sixth Web as Corpus Workshop


Pdf Export Search Proceedings of the NAACL HLT 2010 Sixth Web as Corpus Workshop
[W10-1500]: Adam Kilgarriff | Dekang Lin

Pdf Export Search NoWaC: a large web-based corpus for Norwegian
[W10-1501]: Emiliano Raul Guevara

Pdf Export Search Building a Korean Web Corpus for Analyzing Learner Language
[W10-1502]: Markus Dickinson | Ross Israel | Sun-Hee Lee

Pdf Export Search Sketching Techniques for Large Scale NLP
[W10-1503]: Amit Goyal | Jagadeesh Jagaralamudi | Hal Daumé III | Suresh Venkatasubramanian

Pdf Export Search Building Webcorpora of Academic Prose with BootCaT
[W10-1504]: George Dillon

Pdf Export Search Google Web 1T 5-Grams Made Easy (but not for the computer)
[W10-1505]: Stefan Evert



Pdf Export Search Proceedings of the 2nd International Workshop on Web as Corpus


Pdf Export Search Proceedings of the 2nd International Workshop on Web as Corpus
[W06-1700]:

Pdf Export Search Web-based frequency dictionaries for medium density languages
[W06-1701]: András Kornai | Péter Halácsy | Viktor Nagy | Csaba Oravecz | Viktor Trón | Dániel Varga

Pdf Export Search BE: A search engine for NLP research
[W06-1702]: Mike Cafarella | Oren Etzioni

Pdf Export Search A comparative study on compositional translation estimation using a domain/topic-specific corpus collected from the Web
[W06-1703]: Masatsugu Tonoike | Mitsuhiro Kida | Toshihiro Takagi | Yasuhiro Sasaki | Takehito Utsuro | S. Sato

Pdf Export Search CUCWeb: A Catalan corpus built from the Web
[W06-1704]: Gemma Boleda | Stefan Bott | Rodrigo Meza | Carlos Castillo | Toni Badia | Vicente López

Pdf Export Search Annotated Web as corpus
[W06-1705]: Paul Rayson | James Walkerdine | William H. Fletcher | Adam Kilgarriff

Pdf Export Search Web coverage of the 2004 US Presidential election
[W06-1706]: Arno Scharl | Albert Weichselbraun

Pdf Export Search Corporator: A tool for creating RSS-based specialized corpora
[W06-1707]: Cédrick Fairon

Pdf Export Search The problem of ontology alignment on the Web: A first report
[W06-1708]: Davide Fossati | Gabriele Ghidoni | Barbara Di Eugenio | Isabel Cruz | Huiyong Xiao | Rajen Subba

Pdf Export Search Using the Web as a phonological corpus: A case study from Tagalog
[W06-1709]: Kie Zuraw

Pdf Export Search Web corpus mining by instance of Wikipedia
[W06-1710]: Rüdiger Gleim | Alexander Mehler | Matthias Dehmer