Logo Utrecht University

UiL OTS Labs

How-tos

Accessing the CGN (Corpus Gesproken Nederlands)

Last updated on 22 November 2013 by Martijn van der Klis

If you have suggestions on how to improve this document, or find mistakes, please send them to labman.gw@nulluu.nl

As an UiL OTS labuser you can access the CGN (Corpus Gesproken Nederlands) under Windows and Linux on all lab PC’s (and from home).

This can be done by accessing the corpora share and open the CGNv2 folder. To run the corpus exploitation software (COREX) one needs to run either corex_start_for_linux.sh or corex_start_for_windows.bat. Please note that starting corex might take up to 10 seconds!

Accessing the corpora share can be done via:

  • In the lab (for instance K06), under Windows OS: the shared Lab-Drives should have the letter L and should be visible in file explorer. Here you should be able to find the shared folder corpora which contains the CGNv2 folder.
  • In the lab (for instance K06), under Linux OS: the Lab-Drives mount should be available as a link on your desktop.
  • From outside the lab: we do not recommend connecting to any corpora from outside the lab. Connection, speed and software problems are serious obstacles.

If you expect to be searching the corpus very intensely for a longer period we recommend asking us to create a local copy for you. This is beneficial because running large searches via the network is substantially slower then using a local copy.