test for ARC
introducing support for projects that doesn't provide a link to a specific fundingpath.
implemented job and workflow to export the openaire identifiers
log the number of items clustered on each key
do not consider deleted entities
New test for openaire2.0_data compliance for datasets
updating to dnet-openaire-data-protos:3.5.0
updated to dnet-openaire-data-protos:3.5.0-SNAPSHOT
cleanup, extended tests to include new relationships and mapping profiles
counters
counter test
Tests load gthe XSLT from the TDSRule profiles in dnet-openaireplus-profiles
Back to revision r39888 and updated pom and sh files
added possibility to post-process the result stored in the index documents
ticket #1588 Rename "native" compatibility to "proprietary"
use of external properties
added min distance algorithm, used to identify the connected components (dedup)
limit the job to insttitutional pubsrepository
counter labels
use of Text instead of ImmutableBytesWritable
reimplemented calculatePersonDistribution M/R job to consider only the results from pubsrepositories (not journals)
reuse the same outkey and outvalue objects
added more mapping tests, using xslt picked from services.openaire
spring makes me lazy
added infospace dump mapper
added information space export job
testing umlauts
cleanup
updated to the new mongodb driver specs
Null values for FP7 and H2020 specific fields about OA mandate and Data Pilot.
Do not check the status of a record: we assume we have to insert it because the OAI store is built in refresh mode.
OAIStore with compressed bodies. FCurrently for beta only.
fixed tests, added new dedup specific jobs
added implementors for offline dedup person workflow