update in the generation of the master index
set ignore file
modification to fit with the tree-dedup
create branch for tree dedup
fixed infospace export procedure, avoid to emit the same result more than once
OpenOrgs DB: use of tsv for rels
deduped orgs to OpenOrgs DB (jobs + wfs) using temporary hdfs files
context propagation of projects rels through semantic rels: fixed field assignment when building the relation qualifier
fixed export procedure: include also relationships
added dhp mapping test
included infospace mapping towards the new OAF DHP model
Added new mapper
avoid to print the job configuration in the setup phase
use of dnet-pace-core 3.0.15
put operations WAS SYNC
Added new propagation constants for the ORCID propagation
(openorgs) added schemeid to pids in sql query
pick the first mergedIn identifier
using streams instead of guava collection transformations
added firstname and surname in case of authors without orcid id, extracted from fullname, needed for propagation
exclude the mapping-utils version inherited from transitive dependencies
final logic of propagation of community through organization (products belonging to given organization will be associated to the community)
Testing sygma
fixed test
removed project reference from src/test/resources/eu/dnetlib/data/transform/odf.xml, the test didn't include any check against it
fixed problem on subject null in scholixToActions
Test for checking: #4911 (missing collectedfrom/hostedby identifiers)
Testing guidelines 4 with current odf2hbase mapping (repo: Aria)
tests for qeios
fix
fixed issue
moved resolution of mapping organization- communities to mapper
removed try catch for key validity
using Text as grouping key type between mapper and reducer
fixed error
not needed class
new tests and resources
implementation of propagation of result to community through organization
added new util
refactor and cleaning
refactor for the classname
added classname information. Save the same context just once for the update in h-base
added class_name information to discriminate from bulktagging reasons. Added class to gather the bulktagging constants
use group max size from the wf configuration
added mapper to filter XML (index) records according to a set of criteria
map only job that integrates the body updates before exporting them
fixed type issue
[maven-release-plugin] prepare for next development iteration
[maven-release-plugin] prepare release dnet-mapreduce-jobs-1.2.0
new test
used class to extend ArrayList<String>
updated test and added test files
removed thrown exception when path not found in json
correct serialization of proto as json
added class that extends hashMap<String,String> not to need reflection
naming refactor
modified test for changed implemetation of deserialization of map from json
Alternative implementation w.r.t. reflection for deserializing map from json
NPE check on publisher ugly hack
Adding new common methods for propagation of community to result through semantic relation
Update of propagation constants for propagation of community to result through semantic relation
propagation of community to result through semantic relation
print the dedup config string before parsing it
prefixes must have length = 12
modified generated json for proto structure test
added some fields to generated json
added other counters
Fixes #4362: Scielo is an Open Access Publisher
Instance from Crossref restricted by default instead of closed
RESTRICTED instead of CLOSED, fixed access mode names
Fixes #4562 (orcid format)
Added case for invalid author
Another test publisher
More cases to discard a record for test authors
Fix #4637 and improve check for invalid authors
software link export job
added M/R to export publication-software links
added openaire id to publication
test for constraints on products from datasource
minor (constant has been renamed)
Starting the implementation for propagation of result to community. Result linked by isSupplementedBy to result associatied to the community is linked to the community
update for propagation community-result
added logic for selection criteria implementation on datasources
test for exporting link between publications and softwares (related to #4593)
export of link between publication and software (related to #4593)
added test to validate the informations added to protobuffer
print json but commented out.Added test to get a proto from a json
added resourcetype and resulttype, new work type mapping
mapping orcid work type <--> dnet:publication_resource dnet vocabulary
Depending on released parent
Reomved ignore: Miriam fixed the input in the prev commit
removed improper selcriteria