ACL Anthology
News
(current)
FAQ
(current)
Corrections
(current)
Submissions
(current)
GitHub
Workshop on Web as Corpus (2014)
Volumes
Proceedings of the 9th Web as Corpus Workshop (WaC-9)
7 papers
up
pdf (full)
bib (full)
Proceedings of the 9th Web as Corpus Workshop (WaC-9)
pdf
bib
Proceedings of the 9th Web as Corpus Workshop (
W
a
C
-9)
Felix Bildhauer
|
Roland Schäfer
pdf
bib
Finding Viable Seed
URL
s for Web Corpora: A Scouting Approach and Comparative Study of Available Sources
Adrien Barbaresi
pdf
bib
Focused Web Corpus Crawling
Roland Schäfer
|
Adrien Barbaresi
|
Felix Bildhauer
pdf
bib
Less Destructive Cleaning of Web Documents by Using Standoff Annotation
Maik Stührenberg
pdf
bib
Some Issues on the Normalization of a Corpus of Products Reviews in
P
ortuguese
Magali Sanches Duran
|
Lucas Avanço
|
Sandra Aluísio
|
Thiago Pardo
|
Maria da Graça Volpe Nunes
pdf
bib
{bs,hr,sr}
W
a
C
- Web Corpora of
B
osnian,
C
roatian and
S
erbian
Nikola Ljubešić
|
Filip Klubička
pdf
bib
The
PAISÀ
Corpus of
I
talian Web Texts
Verena Lyding
|
Egon Stemle
|
Claudia Borghetti
|
Marco Brunello
|
Sara Castagnoli
|
Felice Dell’Orletta
|
Henrik Dittmann
|
Alessandro Lenci
|
Vito Pirrelli