Project

General

Profile

1 17038 marek.hors
IIS KDM module responsible for metadata and fulltext extraction. Based on ICM Cermine library.
2
3
Contains all required classes and testing workflow.
4
5
input1:		eu.dnetlib.iis.schemas.protobuf.DocumentContentProto.DocumentContent
6
output1:	eu.dnetlib.iis.schemas.protobuf.DocumentWithBasicMetadataProto.DocumentWithBasicMetadata
7
output2:	eu.dnetlib.iis.schemas.protobuf.DocumentTextProto.DocumentText [disabled by default]
8
9
All the mappings between IIS metadata model and Cermine NLM are described on google docs site:
10
https://docs.google.com/folder/d/0BxyETgjVNSF-R0RBNDFiaGNtT3M/edit?docId=0AuafT1iWZbXddF9nUXFYLUl3bTR0dnd3anBhM3dqSmc
11
on "cermine_to_iis" section.