fixed tests, added new dedup specific jobs
added implementors for offline dedup person workflow
updated sample project records
added tests for author ids generation based on the datasource type
Updating tests: funding path ids include funder shortnames (#1379)
The OAI feed generates "enriched sets" for each content providers by applying a set of xpaths to records to understand if they have been enriched. The xpaths are defined in the OAI configuration profile.
fetch only instancetype and hostedby from the instance attributes, adding url to external references
added configurable max number of rel/children to be expanded in each entity
Added dc:creator
WT ids are uniform now
added dedup roots to csv export job, dedup index feed job, tests
Testing DOAj for #1222#note-4
fundingtree is an escaped xml, not a json anymore.
sample records
reimplemented the fundingpath and context generation
Updated configuration for testing
extended entities join configuration, added more tests
test record took from HDFS
added FCT fundings as contexts
Added oaf:identifiers to record sample.
updated tests
cleanup & tests
added more fields in test record
revised tests
idScheme and idNamespace defined as part of the OAI configuration profile
Removed dependency to dnet-oai-utils to avoid inheritance of unwanted jars such as cnr-rmi-api, cnr-service-common, spring, etc., which should not appear when running a job on the cluster. Needed classes have been copied and adapted so they do not use spring anymore.
small refactor
OAI feed map only job