Project

General

Profile

Statistics
| Revision:
Name Size Revision Age Author Comment
document.json 0 Bytes 30802 almost 10 years Michal Oniszczuk Stub of a solution to the task #576: Ingestion ...
document.xml 149 KB 32942 over 9 years Marek Horst #1017 introducing new PMC metadata ingestion cu...
document_gapped.xml 149 KB 30802 almost 10 years Michal Oniszczuk Stub of a solution to the task #576: Ingestion ...
document_with_affiliations.xml 2.65 KB 37344 about 9 years Marek Horst #1329 adding affiliations field in ExtractedDoc...
od_______908__0451fa1ded79a63729296731e53335c0.xml 137 KB 39086 almost 9 years Marek Horst renaming test resources to be compliant with wi...
od_______908__0452195ccf851072fd097fc49bfbb9da.xml 159 KB 39086 almost 9 years Marek Horst renaming test resources to be compliant with wi...
od_______908__365a50343d53774f68fa13800349d372.xml 5.39 MB 39086 almost 9 years Marek Horst renaming test resources to be compliant with wi...
pretty.xml 206 KB 32942 over 9 years Marek Horst #1017 introducing new PMC metadata ingestion cu...
single-ref-document.xml 35.7 KB 33131 over 9 years Marek Horst replacing non standard dash character to '-'

Latest revisions

# Date Author Comment
39086 08/09/2015 01:43 PM Marek Horst

renaming test resources to be compliant with windows file system naming requirements

37344 20/05/2015 06:49 PM Marek Horst

#1329 adding affiliations field in ExtractedDocumentMetadata PMC schema. Metadata extraction code refactoring by extracting code responsible for building Affiliation avro records to AffiliationBuilder class and sharing it with pmc ingestion. Implementing affiliations ingestion functionality in PmcXmlHandler covered with unit tests. Adding affiliations field support in ingest pmc metadata transformer.

33131 02/12/2014 12:48 PM Marek Horst

replacing non standard dash character to '-'

32942 21/11/2014 05:50 PM Marek Horst

#1017 introducing new PMC metadata ingestion currently extracing references, journal and pages fields.
Replacing DOM/XPath based citations ingestion with much faster SAX version. Changing pmidtooaid transformer utilizing ExtractedDocumentMetadata instead of parsing XML file. Enabling PMC metadata ingestion in common/import.

30802 20/09/2014 02:19 PM Michal Oniszczuk

Stub of a solution to the task #576: Ingestion of metadata from EuropePMC.

View revisions

Also available in: Atom