Description The Corpus was developed primarily to add value to scientific papers, through semantic markup that would make it easier for natural language processing and semantic web applications to automatically extract information pertaining to core scientific concepts. The ART corpus can also be used as a training set for machine learning algorithms, in order to automate the annotation of papers with CISP metadata.
Department(s) Department of Computer Science
Managed by Department of Computer Science
Readme.txt (885Bytes, text/plain)
Licence: CC BY-NC
Description_ART_Corpus.pdf (93KB, application/pdf)
Licence: CC BY-NC
ART_Corpus.tar.gz (2323KB, application/x-gzip)
Licence: CC BY-NC