updating to dnet-openaire-data-protos:3.5.0
updated to dnet-openaire-data-protos:3.5.0-SNAPSHOT
cleanup, extended tests to include new relationships and mapping profiles
counters
counter test
Back to revision r39888 and updated pom and sh files
added possibility to post-process the result stored in the index documents
use of external properties
added min distance algorithm, used to identify the connected components (dedup)
limit the job to insttitutional pubsrepository
counter labels
use of Text instead of ImmutableBytesWritable
reimplemented calculatePersonDistribution M/R job to consider only the results from pubsrepositories (not journals)
reuse the same outkey and outvalue objects
spring makes me lazy
added infospace dump mapper
added information space export job
cleanup
updated to the new mongodb driver specs
Do not check the status of a record: we assume we have to insert it because the OAI store is built in refresh mode.
OAIStore with compressed bodies. FCurrently for beta only.
fixed tests, added new dedup specific jobs
added implementors for offline dedup person workflow