dnet45dhp-schemasdnet-hadoopdnet40dnet50
replacing non standard dash character to '-'
fixing test run on jenkins: seting encoding explicitly to utf8
#1017 introducing new PMC metadata ingestion currently extracing references, journal and pages fields.Replacing DOM/XPath based citations ingestion with much faster SAX version. Changing pmidtooaid transformer utilizing ExtractedDocumentMetadata instead of parsing XML file. Enabling PMC metadata ingestion in common/import.
#757 introducing article type extraction along with unit test. Article type will be required for filtering out pmc duplicates and leaving only proper types
#757 fixing pmid and doi matching, fixing sourceDocumentId and destinationDocumentId generation
Commented out test in a stub of a solution to the task #576: Ingestion of metadata from EuropePMC.
Stub of a solution to the task #576: Ingestion of metadata from EuropePMC.
View revisions
Also available in: Atom