CoAnSys

COntent ANalysis SYStem (CoAnSys)

COntent ANalysis SYStem is a framework for mining scientific publications using Apache Hadoop. It is primarily developed by employees of the Centre for Open Science (CeON) at Interdisciplinary Centre for Mathematical and Computational Modelling (ICM), University of Warsaw (UW).

 

 

Category

Web service or application

 

Scientific areas

Digital Archives

 

Main features

keywords extraction

author disambiguation

document categorization

document similarity

citation matching

metadata extraction

logs processing

 

License

 

Supported Operating Systems

Linux

 

Supported CPU Architectures

x86-64

 

Programming languages

Java

 

Build tools

Maven

 

Test tools

JUnit