renaming test resources to be compliant with windows file system naming requirements: replacing '|' with '_'
renaming test resources to be compliant with windows file system naming requirements
fixing destination id in expected citation record
updating fundingtree value to xml representation and changing expected fundingclass as outcome
#1498 introducing major citations related refactoring including new generic direct citation matching moved to processing phase, introduced position field in all citations schemas and updated collapser taking position into account when merging citations details coming from 3 variuos sources: fuzzy citationmatching, direct citationmatching, references metadata
#1212 updating classification test expected results after fixing typo: dccclasses->ddcclasses in taxonomies.db
updating expected classes, setting acm classes
updating expected classes
introducing missing pdb reference extraction missing parameters
#1315 providing missing confidenceLevel
#1315 updating expected jsons in integration test after DocumentToConceptIds schema refactoring
upgrading xmlns version to 0.4 in order to support global element
#1257 dropping schema generation related hacks in all map-reduce modules, switching to literal schema parameters
Removing usage of working_dir from Java workflow node.
reenabling document to project reference import validation
updating expected documents list
temporarily skipping docproject validation
#1195 removing obsolete ports docreation and datasetid from hbase mapred import, removing references to those ports in workflow.xml files, updating transformer by removing filtering by datasetid due to decisions made in #1072
fixing json escape character by putting \\ in place of \
extending mapreduce metadata importer test with validating import of different kind of relations and dataset identifier
removing obsolete citations
updating confidence level value to 1.0 for record coming from PMC
removing obsolete pdf directory
adding missing "confidenceLevel" field
maintaining pmc citation and testing citations merging process
reintroducing multiple citations after introducing sorting in transformer
limiting citations count to 1 until results order produced by citation matching module is repetitive
including: FET project reference extraction, EGI case, dataset reference extraction outcome validation
enabling citation matching algorithm
updating expected citations
removing comment
primary processing integration test major refactoring: dropping cermine execution and providing plaintext and extracted metadata as json records
overriding memory parameter due to test cluster memory limitations, setting it to Xmx256m
overriding memory parameter due to test cluster memory limitations, setting it to Xmx128m
overriding memory parameter due to test cluster memory limitations, setting it to Xmx512m
updating expected classes in integration test after recent #720 change and fixing confidence level distribution
overriding memory parameter due to test cluster memory limitations
#1133 dropping useless workfing_dir creation for java nodes
#919 renaming DocumentToResearchInitiative to DocumentToConceptId and DocumentToResearchInitiatives to DocumentToConceptIds
#919 adding missing i/o ports related to FET projects reference extraction
introducing separate citations json containing expected results, not enabled in workflow yet
fixing citations schema type
enabling document classification and reserach initiatives reference extraction algorithms
introducing regex support in result approver to support iis::* kind of provenance, updating workflow definitions with proper regex values
#840 moving IdentifierMapping from importer to common package
#840 renaming DeduplicationMapping to more generic IdentifierMapping
disabling workflow tests
removing extracted_metadata.json which will not be checked anymore
skipping extracted_metadata comparison which is cumbersome due to frequent changes and large volume of references
introducing newly added address field in json record
fixing field names after recent Affiliation.avdl refactoring and adding countryCode field, renaming contry to countryName
updating expected output
updating performance test
moving ACM importer to icm-iis-mainworkflows due to extending dependances with cermine, introducing performance tests
updating expected record content
updating expected extracted metadata
Fixing names of parameters accepted by workflow nodes
#486 fixing integration test: introducing missing document_text_wos input port for primary/processing
updating expected references output for doc=id-3
fixing affiliations and positions in authors details
fixing HBase model json representation to be compliant with most recent dnet-openaire-data-protos:3.0.0-SNAPSHOT model: complex relation identifiers, dataInfo on fields level etc
introducing additional logging