using 'jar-with-dependencies' suffix for the uber jar
added deploy.info descriptor, using 'assembly' suffix for the uber jar
removed useless logs, they might clog the cluster resources
addition of the trust in the mapping of the datainfo
test for propagation of result to organization both via datasource and via semantic relation
refactoring
code refactoring
code formatting
exclude the assembly creation stage from the package phase, the uber jar will be uploaded on PPA
removed deploy.info. This module will be managed with a dedicated build job
include the assembly in the package phase
fixed bug: mapper for organizations sent wrong information about the organization id. It was the id of the datasource that was emitted instead of the one of the organization
Test to check the good hostedby is considered by the mapping
Handling empty values for float fields like project totalcosts
update in the generation of the master index
rehash the entire openaire id when building the root identifier to avoid clashes
#4961: ensure we properly build XML records of projects and orgs with summary and budget information
avoid NPEs mapping DOIBoost records
update of the propagation constant for the propagation of organization relation for projects belonging to a ds related to the org
propagation of relationship between products of a datasource and the organization(s) the datasource is related to. The propagation is done if the ds in an institutional repository and if the result is not already linked to the organization. No addition to the provenance of the relation is added in case it would already be present
refactor needed because the InstOrgKey class has been moved to dedicated package
update of the propagation constant to fit also the propagation of author affiliation
propagation of affiliation relation (hasAuthorInstitution) from result to organization through results linked by strong semantic relations. Results already associated to the organization will not have the hasAuthorInstitution relation overwritten
logic of the compositekeys for propagation moved on dedicated package. Ordering datasource, organization, results on the key that groups for the reducer
try to fix issue with svn
moved the implementation of the composite keys in a dedicated package since it is used in mode than one propagation
better handling of side cases
fixed null abstract
set ignore file
modification to fit with the tree-dedup
[broker] factored out method to obtain the key to be emitted by the enrichment map phase
ORCID events are not yet ready for production
limit the dnet-pace-core dependency to use version 3.0.14, i.e. prior to the introduction of the translation map in the configuration
refactoring. changed name to constant
Test for propagation of result to community through organization
Test for ORCID propagation to result through semantic relation
update of propagation constant for orcid propagation and propagation of product to organization through semantic relation
mapreduce job for the propagation of ORCID through result. Follows only isSupplementedBy, isSupplementTo semantic relations
Added one counter to count the number of results per community
DOIBOOST mapping: include dates formatted as \d{4}-\d{1,2}-\d{1,2}, discard records not providing at least one date
fixed valid date Actions
aligned infospace exporter with dhp-schema:1.0.4
create branch for tree dedup
fixed infospace export procedure, avoid to emit the same result more than once
OpenOrgs DB: use of tsv for rels
deduped orgs to OpenOrgs DB (jobs + wfs) using temporary hdfs files
context propagation of projects rels through semantic rels: fixed field assignment when building the relation qualifier
fixed export procedure: include also relationships
added dhp mapping test
included infospace mapping towards the new OAF DHP model
Added new mapper
avoid to print the job configuration in the setup phase
use of dnet-pace-core 3.0.15
put operations WAS SYNC
Added new propagation constants for the ORCID propagation
(openorgs) added schemeid to pids in sql query
pick the first mergedIn identifier
using streams instead of guava collection transformations
added firstname and surname in case of authors without orcid id, extracted from fullname, needed for propagation
exclude the mapping-utils version inherited from transitive dependencies
final logic of propagation of community through organization (products belonging to given organization will be associated to the community)
Testing sygma
fixed test
removed project reference from src/test/resources/eu/dnetlib/data/transform/odf.xml, the test didn't include any check against it
fixed problem on subject null in scholixToActions
Test for checking: #4911 (missing collectedfrom/hostedby identifiers)
Testing guidelines 4 with current odf2hbase mapping (repo: Aria)
tests for qeios
fix
fixed issue
moved resolution of mapping organization- communities to mapper
removed try catch for key validity
using Text as grouping key type between mapper and reducer
fixed error
not needed class
new tests and resources
implementation of propagation of result to community through organization
added new util
refactor and cleaning
refactor for the classname
added classname information. Save the same context just once for the update in h-base
added class_name information to discriminate from bulktagging reasons. Added class to gather the bulktagging constants
use group max size from the wf configuration
added mapper to filter XML (index) records according to a set of criteria
map only job that integrates the body updates before exporting them
fixed type issue
[maven-release-plugin] prepare for next development iteration
[maven-release-plugin] copy for tag dnet-mapreduce-jobs-1.2.0
[maven-release-plugin] prepare release dnet-mapreduce-jobs-1.2.0
new test
used class to extend ArrayList<String>
updated test and added test files
removed thrown exception when path not found in json