avoid NPEs mapping DOIBoost records
refactoring
update of the propagation constant for the propagation of organization relation for projects belonging to a ds related to the org
propagation of relationship between products of a datasource and the organization(s) the datasource is related to. The propagation is done if the ds in an institutional repository and if the result is not already linked to the organization. No addition to the provenance of the relation is added in case it would already be present
refactor needed because the InstOrgKey class has been moved to dedicated package
update of the propagation constant to fit also the propagation of author affiliation
propagation of affiliation relation (hasAuthorInstitution) from result to organization through results linked by strong semantic relations. Results already associated to the organization will not have the hasAuthorInstitution relation overwritten
logic of the compositekeys for propagation moved on dedicated package. Ordering datasource, organization, results on the key that groups for the reducer
try to fix issue with svn
moved the implementation of the composite keys in a dedicated package since it is used in mode than one propagation
better handling of side cases
fixed null abstract
set ignore file
modification to fit with the tree-dedup
[broker] factored out method to obtain the key to be emitted by the enrichment map phase
ORCID events are not yet ready for production
limit the dnet-pace-core dependency to use version 3.0.14, i.e. prior to the introduction of the translation map in the configuration
refactoring. changed name to constant
Test for propagation of result to community through organization
Test for ORCID propagation to result through semantic relation
update of propagation constant for orcid propagation and propagation of product to organization through semantic relation
mapreduce job for the propagation of ORCID through result. Follows only isSupplementedBy, isSupplementTo semantic relations
Added one counter to count the number of results per community
DOIBOOST mapping: include dates formatted as \d{4}-\d{1,2}-\d{1,2}, discard records not providing at least one date
fixed valid date Actions
aligned infospace exporter with dhp-schema:1.0.4
create branch for tree dedup
fixed infospace export procedure, avoid to emit the same result more than once
OpenOrgs DB: use of tsv for rels
deduped orgs to OpenOrgs DB (jobs + wfs) using temporary hdfs files
context propagation of projects rels through semantic rels: fixed field assignment when building the relation qualifier
fixed export procedure: include also relationships
added dhp mapping test
included infospace mapping towards the new OAF DHP model
Added new mapper
avoid to print the job configuration in the setup phase
use of dnet-pace-core 3.0.15
put operations WAS SYNC
Added new propagation constants for the ORCID propagation
(openorgs) added schemeid to pids in sql query
pick the first mergedIn identifier
using streams instead of guava collection transformations
added firstname and surname in case of authors without orcid id, extracted from fullname, needed for propagation
exclude the mapping-utils version inherited from transitive dependencies
final logic of propagation of community through organization (products belonging to given organization will be associated to the community)
Testing sygma
fixed test
removed project reference from src/test/resources/eu/dnetlib/data/transform/odf.xml, the test didn't include any check against it
fixed problem on subject null in scholixToActions
Test for checking: #4911 (missing collectedfrom/hostedby identifiers)
Testing guidelines 4 with current odf2hbase mapping (repo: Aria)
tests for qeios
fix
fixed issue
moved resolution of mapping organization- communities to mapper
removed try catch for key validity
using Text as grouping key type between mapper and reducer
fixed error
not needed class
new tests and resources
implementation of propagation of result to community through organization
added new util
refactor and cleaning
refactor for the classname
added classname information. Save the same context just once for the update in h-base
added class_name information to discriminate from bulktagging reasons. Added class to gather the bulktagging constants
use group max size from the wf configuration
added mapper to filter XML (index) records according to a set of criteria
map only job that integrates the body updates before exporting them
fixed type issue
[maven-release-plugin] prepare for next development iteration
[maven-release-plugin] copy for tag dnet-mapreduce-jobs-1.2.0
[maven-release-plugin] prepare release dnet-mapreduce-jobs-1.2.0
new test
used class to extend ArrayList<String>
updated test and added test files
removed thrown exception when path not found in json
correct serialization of proto as json
added class that extends hashMap<String,String> not to need reflection
naming refactor
modified test for changed implemetation of deserialization of map from json
Alternative implementation w.r.t. reflection for deserializing map from json
NPE check on publisher ugly hack
Adding new common methods for propagation of community to result through semantic relation
Update of propagation constants for propagation of community to result through semantic relation
propagation of community to result through semantic relation
print the dedup config string before parsing it
prefixes must have length = 12
modified generated json for proto structure test
added some fields to generated json
added other counters
Fixes #4362: Scielo is an Open Access Publisher
Instance from Crossref restricted by default instead of closed
RESTRICTED instead of CLOSED, fixed access mode names
Fixes #4562 (orcid format)
Added case for invalid author