New test for openaire2.0_data compliance for datasets
updated to dnet-openaire-data-protos:3.5.0-SNAPSHOT
cleanup, extended tests to include new relationships and mapping profiles
Tests load gthe XSLT from the TDSRule profiles in dnet-openaireplus-profiles
ticket #1588 Rename "native" compatibility to "proprietary"
added min distance algorithm, used to identify the connected components (dedup)
added more mapping tests, using xslt picked from services.openaire
testing umlauts
fixed tests, added new dedup specific jobs
added implementors for offline dedup person workflow
MapDocument implements a more general view of the pace model
added tests for author ids generation based on the datasource type
The OAI feed generates "enriched sets" for each content providers by applying a set of xpaths to records to understand if they have been enriched. The xpaths are defined in the OAI configuration profile.
added configurable max number of rel/children to be expanded in each entity
cleanup
updated to the new pace specs, cleanup
added dedup roots to csv export job, dedup index feed job, tests
added dedup configuration to the entities merging process
commenting test with big doaj dataset
Testing DOAj for #1222#note-4
reimplemented the fundingpath and context generation
updated packages
renamed test
added json size test
extended entities join configuration, added more tests
added FCT fundings as contexts
merged branch ProtoMapping
updated tests
cleanup & tests
revised tests
added serialization, tests
Refactored class that extracts fields from records. When we can't find an expected index from the configuration to check its repeatability, the field is indexed as repeatable and a counter is updated.
idScheme and idNamespace defined as part of the OAI configuration profile
Removed dependency to dnet-oai-utils to avoid inheritance of unwanted jars such as cnr-rmi-api, cnr-service-common, spring, etc., which should not appear when running a job on the cluster. Needed classes have been copied and adapted so they do not use spring anymore.
oaf schema location passed as parameter by the workflow
Testing without depending on a running mdstore
small refactor
OAI feed map only job
fixed oaf to xml serialization
merged from branch 0.0.4
fixed IIS output escaping