Project

General

Profile

1
IIS KDM module responsible for metadata and fulltext extraction. Based on ICM Cermine library.
2

    
3
Contains all required classes and testing workflow.
4

    
5
**Mafju review (2015-09-04)**: the input descriptions below are surely outdated since we don't use Protocol Buffers any more
6

    
7
input1:		eu.dnetlib.iis.schemas.protobuf.DocumentContentProto.DocumentContent 
8
output1:	eu.dnetlib.iis.schemas.protobuf.DocumentWithBasicMetadataProto.DocumentWithBasicMetadata
9
output2:	eu.dnetlib.iis.schemas.protobuf.DocumentTextProto.DocumentText [disabled by default]
10

    
11
All the mappings between IIS metadata model and Cermine NLM are described on google docs site:
12
https://docs.google.com/folder/d/0BxyETgjVNSF-R0RBNDFiaGNtT3M/edit?docId=0AuafT1iWZbXddF9nUXFYLUl3bTR0dnd3anBhM3dqSmc
13
on "cermine_to_iis" section.
(1-1/3)