dnet45dhp-schemasdnet-hadoopdnet40dnet50
#1395 WorkflowRuntimeParameters static fields cleanup, moving parameters to dedicated modules to prevent excessing icm-iis-common module modifications
#1329 adding affiliations field in ExtractedDocumentMetadata PMC schema. Metadata extraction code refactoring by extracting code responsible for building Affiliation avro records to AffiliationBuilder class and sharing it with pmc ingestion. Implementing affiliations ingestion functionality in PmcXmlHandler covered with unit tests. Adding affiliations field support in ingest pmc metadata transformer.
#1248 persisting content url in supplementaryData to make it easier to find content causing failure
#1248 bugfix renaming inputEntityId to inputObjectId after schema changes
#1248 introducing failures sink datastore support in metadata extraction module
Removing usage of working_dir from Java workflow node.
#913 renaming DocumentContentUrl#contentSize to DocumentContentUrl#contentSizeKB changing field type from int to long, importing content size from ObjectStoreFile#fileSizeKB, updating dnet-objectstore-rmi dependency from 1.0.0 to 2.0.1-SNAPSHOT
#913 reading content size from newly introduced DocumentContentUrl#contentSize field not from URLConnection where size is not available when Transfer-Encoding=chunked, setting to 0 when size is not available
#913 introducing support for max file size parameter, currently checked against Content-Lenght header
affiliation's address and country code passed from Cermine to Avro
View revisions
Also available in: Atom