testing author dedup
branch offline dedup
Tests load gthe XSLT from the TDSRule profiles in dnet-openaireplus-profiles
Back to revision r39888 and updated pom and sh files
bumbed minor version
make SNAPSHOTs visible to this module
added possibility to post-process the result stored in the index documents
ticket #1588 Rename "native" compatibility to "proprietary"
use of external properties
added min distance algorithm, used to identify the connected components (dedup)
bumped version
limit the job to insttitutional pubsrepository
counter labels
use of Text instead of ImmutableBytesWritable
reimplemented calculatePersonDistribution M/R job to consider only the results from pubsrepositories (not journals)
reuse the same outkey and outvalue objects
added more mapping tests, using xslt picked from services.openaire
spring makes me lazy
added infospace dump mapper
added information space export job
testing umlauts
cleanup
updated to the new mongodb driver specs
Null values for FP7 and H2020 specific fields about OA mandate and Data Pilot.
Do not check the status of a record: we assume we have to insert it because the OAI store is built in refresh mode.
OAIStore with compressed bodies. FCurrently for beta only.
fixed tests, added new dedup specific jobs
added implementors for offline dedup person workflow