Project

General

Profile

Statistics
| Revision:
Name Size Revision Age Author Comment
  tags 30419 almost 10 years Sandro La Bruzzo created tag folder for release
  trunk 33105 over 9 years Marek Horst #1017 accepting ExtractedDocumentMetadata inste...

Latest revisions

# Date Author Comment
33105 28/11/2014 06:13 PM Marek Horst

#1017 accepting ExtractedDocumentMetadata instead of DocumentText at PMC citation ingestion input. Aliging integration test and importer workflow.

33098 28/11/2014 04:27 PM Marek Horst

#1022 introducing extracted document metadata collapser at importing phase.
Propagating extracted document mentadata (including PMC ingested metadata) to processing part of workflow what can be exploited by citation matching module.
Introducing citations collapser in last stage of processing phase collapsing ingested citations with matched citations.

32943 21/11/2014 05:50 PM Marek Horst

#1017 introducing new PMC metadata ingestion currently extracing references, journal and pages fields.
Replacing DOM/XPath based citations ingestion with much faster SAX version. Changing pmidtooaid transformer utilizing ExtractedDocumentMetadata instead of parsing XML file. Enabling PMC metadata ingestion in common/import.

32829 17/11/2014 03:45 PM Marek Horst

#963 propagating dataset -> mdstore from import to exporting phase: importer produces DocumentToMDStore datasetore utilized by exporter module. Updating transformer definition to handle DocumentToMDStore instead of Identifier schema

32825 17/11/2014 03:43 PM Marek Horst

introducing separate citations json containing expected results, not enabled in workflow yet

32824 17/11/2014 03:42 PM Marek Horst

updating job.properties

32823 17/11/2014 03:42 PM Marek Horst

updating job.properties

32167 04/11/2014 02:04 PM Marek Horst

updating job.properties

32166 04/11/2014 02:01 PM Marek Horst

updating job.properties

32165 04/11/2014 02:01 PM Marek Horst

updating job.properties

View revisions

Also available in: Atom