update in the generation of the master index
set ignore file
modification to fit with the tree-dedup
create branch for tree dedup
Updated DOIBoost test as they are in the trunk branch
Updated classes for DOIBoost based on the trunk version
implementation of the procedure to export native softwares on hdfsaddition of needed workflows and classes
actually incrementing the counters just added
added counters for matching subject, content providers and zenodo community for each community
test for the new implementation of zenodo community value
changed to consider to mirror the change in zenodo community value: zenodo community instead of the openaire community associated to a result. The context of the zenodo community will be removed from those of the result if the zenodo community is not associated to any openaire community.
replaced CrossRef with Crossref
depending on released parent, updated dependency dnet-openaireplus-mapping-utils to most recent release
bumped version dep dnet-openaireplus-mapping-utils:6.2.25
fixed creation of the coordinates (columnFamily) driving the Put operation
less log verbosity
fixRelations must work on main entities in a single scan pass
added support for simulation mode: allows to do not change the data and keep track of summary counters
reverted version of mapping-utils dependency
bumped version of mapping-utils dependency
[maven-release-plugin] prepare for next development iteration
[maven-release-plugin] prepare release dnet-mapreduce-jobs-1.1.12-MASTER
introduced jobs to fix the relationships among deduped records. Got rid of deprecations on the HBase Put method usage
refactoring and fixed issue for empty project list
Added new check for result type checking inside list of results in country propagation
fixed issue for empty result in list of results to which propagate the country (last fix produced a bug)
changed message in context
bumped version after fixing bug for list of result(50|) empty - Propagation
fixed bug for list of result(50|) empty
Propagation general Iterator and Contants class
Iterator to handle results from mapper for project propagation
Reducers fro project propagation to File and to HBase
Iterator for managing values from mapper for result country propagation
fixed issue
using most recent release of dnet-openaireplus-mapping-utils
new Exception class for not valid list of values in reducer
modified type for type variable. From int to Type
added new variables and methods
updated implementation to use iterator
specific iterator for country propagation
Added generic Iterator over the list of information gathered from the reducer
fix error on parsing abstract
updated to consider the new class Utils and the new values in Propagation Constants classes
updated to store properties of the value
class to contain the methods common to the propagation classes among the propagation types
update of the constants for the propagation
map-reduce to implement the propagation of the association between project and result through the existence of semantic relations
map to count the number of results having a semantic relation of type isSupplementedBy that also have a relation isProducedBy with at least one project
added test to verify the parsing of EPMC record
depending on dnet-openaireplus-mapping-utils:6.2.21
avoid to import non necessary affiliations from DOIBoost
depend on mappingutils 6.2.19 for better mapping of software
[maven-release-plugin] prepare release dnet-mapreduce-jobs-1.1.10-MASTER
Testing software from biotools
fixed bug for uncompressed abstract in DOIBoostToAction
changed mapping for compressed abstract in DOIBoostToAction
less verbose logging
excluded clashing versions of jackson. We want to keep ours
using proper import for LogFactory
[maven-release-plugin] prepare release dnet-mapreduce-jobs-1.1.9-MASTER
using most recent dnet-openaireplus-mapping utils version (6.2.18)
added Mapper and Reducer class for infoSpace counts workflows
added reducer to produce counts on the infospace
cannot use Guava's Splitter.splitToList, must stick to basic split method. Classpath is messed up
implemented ConfigurableExportMapper
Removed un-used import
fixed test
updated Mapper to return the whole invalid record
export invalid xml records
refactored Action
Map only job that produces [openaireId, doi] pairs of records containing invalid characters
reduced file size
added parameter to filter only organization in DOIBoostToAction
fixed problem of missing name in authors
merged beta branch to master
Change Mapper to implement DOIBoostToAction
Created DOIBoostToActions Mapping
introduced use of BlockProcessor
[maven-release-plugin] prepare release dnet-mapreduce-jobs-1.1.8-MASTER
updated test for hbase mapping for organizations
bumped version, dnet-openaireplus-mapping-utils:6.2.15 should fix the unmapped instancetype terms
updated dnet-openaireplus-mapping-utils dependency version
fixed issue when country information is not present for datasource
change throwing of exception with counters
using updated mapping-utils module
change parameter from ImmutableBytesWritable to Text
refactoring and change of counters
using updated mapping-utils module, added unit test to check the merge procedure for context and country updates
rollback wrong commit
fixing and testing propagation implementation
reducer for country propagation that writes on hdfs
cleanup pid types in order to make them valid attributes