CERMINE

Content ExtRactor and MINEr (CERMINE)

CERMINE is a Java library and a web service for extracting metadata and content from PDF files containing academic publications. CERMINE is written in Java at Centre for Open Science at Interdisciplinary Centre for Mathematical and Computational Modelling, University of Warsaw.

 

 

Category

Web service or application

 

Scientific areas

Digital Library

 

Main features

metadata extraction

parsed bibliographic reference extarction

content extraction

 

License

GNU Affero General Public License 3.0 (AGPL-3.0)

 

Supported Operating Systems

Linux

 

Supported CPU Architectures

x86-64

 

Programming languages

Java

 

Build tools

Maven

 

Test tools

JUnit