Sebastian Gottwald


2008

pdf bib
Tapping Huge Temporally Indexed Textual Resources with WCTAnalyze
Sebastian Gottwald | Matthias Richter | Gerhard Heyer | Gerik Scheuermann
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)

WCTAnalyze is a tool for storing, accessing and visually analyzing huge collections of temporally indexed data. It is motivated by applications in media analysis, business intelligence etc. where higher level analysis is performed on top of linguistically and statistically processed unstructured textual data. WCTAnalyze combines fast access with economically storage behaviour and appropriates a lot of built in visualization options for result presentation in detail as well as in contrast. So it enables an efficient and effective way to explore chronological text patterns of word forms, their co-occurrence sets and co-occurrence set intersections. Digging deep into co-occurrences of the same semantic or syntactic describing wordforms, some entities can be recognized as to be temporal related, whereas other differ significantly. This behaviour motivates approaches in interactive discovering events based on co-occurrence subsets.