Removed un-used import
fixed test
updated Mapper to return the whole invalid record
export invalid xml records
refactored Action
Map only job that produces [openaireId, doi] pairs of records containing invalid characters
reduced file size
added parameter to filter only organization in DOIBoostToAction
fixed problem of missing name in authors
merged beta branch to master
Change Mapper to implement DOIBoostToAction
Created DOIBoostToActions Mapping
introduced use of BlockProcessor
[maven-release-plugin] prepare for next development iteration
[maven-release-plugin] copy for tag dnet-mapreduce-jobs-1.1.8-MASTER
[maven-release-plugin] prepare release dnet-mapreduce-jobs-1.1.8-MASTER
updated test for hbase mapping for organizations
bumped version, dnet-openaireplus-mapping-utils:6.2.15 should fix the unmapped instancetype terms
updated dnet-openaireplus-mapping-utils dependency version
fixed issue when country information is not present for datasource
change throwing of exception with counters
using updated mapping-utils module
change parameter from ImmutableBytesWritable to Text
refactoring and change of counters
using updated mapping-utils module, added unit test to check the merge procedure for context and country updates
rollback wrong commit
fixing and testing propagation implementation
reducer for country propagation that writes on hdfs
cleanup pid types in order to make them valid attributes
Need to set resourceTypeGeneral to clinicalTrial as this is where the IIS can distinguish clinical trial records from "normal dataset"
added code for propagation of countries from institutional organization
why parse strings as Floats?
subject terms exporter
[maven-release-plugin] copy for tag dnet-mapreduce-jobs-1.1.6-MASTER
[maven-release-plugin] prepare release dnet-mapreduce-jobs-1.1.6-MASTER
updated pom, master branch
master branch for deployments @ICM
reverted to r52985 . Test runs shows we need to rely on the edgeIds produced by the connected components identfication phase instead of the vertexIds
changes to consider modification in code to align to trunck version
alignment to trunk version
added default value for trust as "0.8"
remove reducer for bulktaggin
propagation general classes
propagation for inst repos modev in dedicated folder
modification for using trust as parameter of the configuration of the hodoop job and change in the provenance
avoid to produce duplicated events by eliminating the roots from the comparison process
broker event serialization test for ProjectEventFactory
introduced mapping resulttype -> portal url
refactor to consider the change in provenance for bulktagging operations
fixing issues on new param
enabling/disabling of writing operation
[maven-release-plugin] copy for tag dnet-mapreduce-jobs-1.1.5-BETA
[maven-release-plugin] prepare release dnet-mapreduce-jobs-1.1.5-BETA
Fixed log class name
cleaned up unused method, using setDurability in put operation
avoid collisions when hashing pids by value
change the name for the Key in country propagation from institutional repositories
DedupBuildRoots[mapper|reducer] merged implementation from beta branch
cleaned up Tika Lib
added charset discovery mechanism for Grid.AC
country propagation. Update to consider trust of the datasource as trust of the propagation
renamed file used in test
integrated GridAcImport job (mapper/reducer) stuff from beta branch
updated opentrial input record used in test, fixes #3886
added mapper and hadoop job configuration file for importing Grid.AC organization data
allow to configure datasource typologies to be considered in the propagation process
using mapping-utils version 6.2.11
integrating bulktag from trunk to beta branch
jason!
rule out invalid dates also on CrossRefToActions
rule out invalid dates on ScholixToActions
changed verification for country element
cleanup, move forward with PropagationCountryInstitutionalOrganizationMapper & Reducer implementation
mapper and reducer for propagation of country for institutional repositories
propagation for country (organizational institution)
propagation country inst repo mapper and reducer
second sorting
Second sorting
cleanup, move forward with PropagationCountryInstitutionalOrganizationMapper implementation
propagation reducer
propagation
cleanup
produce 'supplement' subrel type in case of supplement relationships
simplified connected component application on the graph
updated dependencies: dnet-pace-core
adding check to understand the bug of wrong relation generated
do not skip processing datasets in DedupBuildRootsMapper, improved error reporting in DedupBuildRootsReducer
do not push vertex ids in memory, process them on the fly
fixed name of TDS profile used by test method
added jobs for predatory journal analysis
removed warning
added invisible setup