COBHUNI Tools
The COBHUNI tools for computational linguistics
In this section, you can find information about and links to differnet computational linguistic tools that are developed by the COBHUNI project team in the course of the ERC project until 2020.
2016:
Wiki to json export
Python program for exporting OCRed and post-corrected texts from the COBHUNI wiki and converting them into json format
by Dr. Alicia Gonzalez
Links:
https://gitlab.com/alrazi/wiki_export/
https://github.com/cobhuni/wiki_export
Initial XMI converter
Java program to convert json files from the COBHUNI Corpus into xmi format in order to prepare them for the annotation in WebAnno
by Dr. Alicia Gonzalez
Links:
https://gitlab.com/alrazi/ini_xmiconverter
https://github.com/cobhuni/ini_xmiconverter
2017:
Pepper Module for COBHUNI
Custom JSON Importer from COBHUNI Corpus into Annis
by Dr. Alicia Gonzalez
Links:
https://gitlab.com/alrazi/pepperModules-CUBHUNIModules
https://github.com/cobhuni/pepperModules-CUBHUNIModules
XMI to JSON converter
Java program to convert xmi files to JSON for data in the COBHUNI Project.
by Dr. Alicia Gonzalez
Links:
https://gitlab.com/alrazi/jsonxmihandler
https://github.com/cobhuni/jsonxmihandler
Errors fixer
Python project for fixing typos annotated in WebAnno for COBHUNI Corpus and adjusting all offset annotations.
by Dr. Alicia Gonzalez
Links:
https://gitlab.com/alrazi/errors_fixer
https://github.com/cobhuni/errors_fixer
Expand data
Creates COBHUNI Corpus with metadata, tokenized text and annotations in JSON format.
by Dr. Alicia Gonzalez
Links: