1
|
IIS KDM module responsible for metadata and fulltext extraction. Based on ICM Cermine library.
|
2
|
|
3
|
Contains all required classes and testing workflow.
|
4
|
|
5
|
**Mafju review (2015-09-04)**: the input descriptions below are surely outdated since we don't use Protocol Buffers any more
|
6
|
|
7
|
input1: eu.dnetlib.iis.schemas.protobuf.DocumentContentProto.DocumentContent
|
8
|
output1: eu.dnetlib.iis.schemas.protobuf.DocumentWithBasicMetadataProto.DocumentWithBasicMetadata
|
9
|
output2: eu.dnetlib.iis.schemas.protobuf.DocumentTextProto.DocumentText [disabled by default]
|
10
|
|
11
|
All the mappings between IIS metadata model and Cermine NLM are described on google docs site:
|
12
|
https://docs.google.com/folder/d/0BxyETgjVNSF-R0RBNDFiaGNtT3M/edit?docId=0AuafT1iWZbXddF9nUXFYLUl3bTR0dnd3anBhM3dqSmc
|
13
|
on "cermine_to_iis" section.
|