| Revision:

# Date Author Comment
58659 09/05/2020 10:42 AM Claudio Atzori

added mapping for externalreferences

58601 05/05/2020 12:43 PM Claudio Atzori

less memory pressure on the hbase table export job, context propagation utils. Proto exporter aligned with most recent dhp.model changes

58249 13/03/2020 11:03 AM Alessia Bardi

Set to invisible if there is no URL

58220 10/03/2020 04:46 PM Alessia Bardi

Added publisher field

58166 03/03/2020 03:48 PM Claudio Atzori

trying to reduce memory footprint

58163 03/03/2020 12:00 PM Claudio Atzori

fixed counter name formatting

58162 03/03/2020 10:20 AM Claudio Atzori

read only the body qualifier instead of the entire column family

58150 26/02/2020 05:56 PM Claudio Atzori

set bestaccessright.classname to 'not avalable' when the classid is UNKNOWN

58142 26/02/2020 12:03 PM Claudio Atzori

set UNKNOWN bestaccessright in case of empty accessright

58129 24/02/2020 02:27 PM Claudio Atzori

added counters for exported content (entities and relations)

58095 13/02/2020 06:32 PM Claudio Atzori

protobuf to dhp model mapping aligned with dhp-schema:1.1.5

58091 12/02/2020 05:24 PM Claudio Atzori

log the actual xslt engine

58086 12/02/2020 04:34 PM Claudio Atzori

serializing processingchargeamount and currency

58071 10/02/2020 04:06 PM Alessia Bardi

#4008: refereed field as property of the result instance and supported both from OAF and ODF.

58060 06/02/2020 02:00 PM Claudio Atzori

fixed mapping for dateofcollection and dateoftransformation

58047 03/02/2020 09:07 AM Claudio Atzori

avoid repeated colons in the dedup id creation

58039 30/01/2020 10:38 AM Claudio Atzori

use jackson object mapper to serialize dhp oaf model ignoring blank properties

58032 30/01/2020 09:59 AM Claudio Atzori

added mapping for Result.Instance

57925 20/12/2019 10:42 AM Michele Artini
57917 19/12/2019 09:54 AM Claudio Atzori

removed useless logs, they might clog the cluster resources

57838 09/12/2019 11:05 AM Michele De Bonis

addition of the trust in the mapping of the datainfo

57827 05/12/2019 05:32 PM Miriam Baglioni


57786 03/12/2019 10:05 AM Miriam Baglioni

code refactoring

57785 03/12/2019 10:04 AM Miriam Baglioni


57783 03/12/2019 09:49 AM Claudio Atzori

code formatting

57778 02/12/2019 03:18 PM Miriam Baglioni

fixed bug: mapper for organizations sent wrong information about the organization id. It was the id of the datasource that was emitted instead of the one of the organization

57774 02/12/2019 11:38 AM Alessia Bardi

Handling empty values for float fields like project totalcosts

57766 29/11/2019 12:49 PM Claudio Atzori

rehash the entire openaire id when building the root identifier to avoid clashes

57758 28/11/2019 08:12 PM Alessia Bardi

#4961: ensure we properly build XML records of projects and orgs with summary and budget information

57749 28/11/2019 03:29 PM Claudio Atzori

avoid NPEs mapping DOIBoost records

57724 27/11/2019 05:59 PM Miriam Baglioni


57723 27/11/2019 05:58 PM Miriam Baglioni

update of the propagation constant for the propagation of organization relation for projects belonging to a ds related to the org

57722 27/11/2019 05:57 PM Miriam Baglioni

propagation of relationship between products of a datasource and the organization(s) the datasource is related to. The propagation is done if the ds in an institutional repository and if the result is not already linked to the organization. No addition to the provenance of the relation is added in case it would already be present

57721 27/11/2019 05:55 PM Miriam Baglioni


57717 27/11/2019 04:24 PM Miriam Baglioni

refactor needed because the InstOrgKey class has been moved to dedicated package

57716 27/11/2019 04:09 PM Miriam Baglioni

update of the propagation constant to fit also the propagation of author affiliation

57715 27/11/2019 04:08 PM Miriam Baglioni

propagation of affiliation relation (hasAuthorInstitution) from result to organization through results linked by strong semantic relations. Results already associated to the organization will not have the hasAuthorInstitution relation overwritten

57714 27/11/2019 04:06 PM Miriam Baglioni

logic of the compositekeys for propagation moved on dedicated package. Ordering datasource, organization, results on the key that groups for the reducer

57713 27/11/2019 04:05 PM Miriam Baglioni

try to fix issue with svn

57712 27/11/2019 03:50 PM Miriam Baglioni

logic of the compositekeys for propagation moved on dedicated package. Ordering datasource, organization, results on the key that groups for the reducer

57711 27/11/2019 03:48 PM Miriam Baglioni

moved the implementation of the composite keys in a dedicated package since it is used in mode than one propagation

57709 27/11/2019 02:43 PM Claudio Atzori

better handling of side cases

57699 27/11/2019 10:03 AM Sandro La Bruzzo

fixed null abstract

57634 19/11/2019 11:24 AM Claudio Atzori

[broker] factored out method to obtain the key to be emitted by the enrichment map phase

57633 18/11/2019 10:37 AM Claudio Atzori

ORCID events are not yet ready for production

57632 18/11/2019 10:01 AM Claudio Atzori

limit the dnet-pace-core dependency to use version 3.0.14, i.e. prior to the introduction of the translation map in the configuration

57589 12/11/2019 03:28 PM Miriam Baglioni

refactoring. changed name to constant

57581 11/11/2019 06:08 PM Miriam Baglioni

update of propagation constant for orcid propagation and propagation of product to organization through semantic relation

57580 11/11/2019 06:07 PM Miriam Baglioni

mapreduce job for the propagation of ORCID through result. Follows only isSupplementedBy, isSupplementTo semantic relations

57560 11/11/2019 11:37 AM Miriam Baglioni

Added one counter to count the number of results per community

57556 08/11/2019 05:07 PM Claudio Atzori

DOIBOOST mapping: include dates formatted as \d{4}-\d{1,2}-\d{1,2}, discard records not providing at least one date

57553 08/11/2019 03:57 PM Sandro La Bruzzo

fixed valid date Actions

57544 07/11/2019 06:11 PM Claudio Atzori

aligned infospace exporter with dhp-schema:1.0.4

57533 06/11/2019 03:50 PM Claudio Atzori

fixed infospace export procedure, avoid to emit the same result more than once

57520 05/11/2019 05:26 PM Claudio Atzori

fixed infospace export procedure, avoid to emit the same result more than once

57517 05/11/2019 04:44 PM Michele Artini

OpenOrgs DB: use of tsv for rels

57513 05/11/2019 03:58 PM Claudio Atzori

fixed infospace export procedure, avoid to emit the same result more than once

57510 05/11/2019 12:19 PM Michele Artini

deduped orgs to OpenOrgs DB (jobs + wfs) using temporary hdfs files

57509 05/11/2019 10:53 AM Claudio Atzori

context propagation of projects rels through semantic rels: fixed field assignment when building the relation qualifier

57508 04/11/2019 03:10 PM Claudio Atzori

fixed export procedure: include also relationships

57507 04/11/2019 02:02 PM Claudio Atzori

added dhp mapping test

57505 04/11/2019 12:02 PM Claudio Atzori

included infospace mapping towards the new OAF DHP model

57502 04/11/2019 09:57 AM Sandro La Bruzzo

Added new mapper

57472 29/10/2019 09:23 PM Claudio Atzori

avoid to print the job configuration in the setup phase

57450 24/10/2019 12:00 PM Michele Artini

use of dnet-pace-core 3.0.15

57443 22/10/2019 02:59 PM Claudio Atzori

put operations WAS SYNC

57432 21/10/2019 04:41 PM Miriam Baglioni

Added new propagation constants for the ORCID propagation

57379 15/10/2019 05:41 PM Claudio Atzori

pick the first mergedIn identifier

57322 08/10/2019 06:04 PM Claudio Atzori

using streams instead of guava collection transformations

57270 04/10/2019 03:41 PM Enrico Ottonello

added firstname and surname in case of authors without orcid id, extracted from fullname, needed for propagation

57211 01/10/2019 11:45 AM Miriam Baglioni

final logic of propagation of community through organization (products belonging to given organization will be associated to the community)

57193 30/09/2019 11:38 AM Sandro La Bruzzo

fixed test

57188 30/09/2019 11:01 AM Claudio Atzori

removed project reference from src/test/resources/eu/dnetlib/data/transform/odf.xml, the test didn't include any check against it

57186 30/09/2019 10:45 AM Sandro La Bruzzo

fixed problem on subject null in scholixToActions

56886 09/08/2019 11:50 AM Miriam Baglioni


56885 09/08/2019 11:41 AM Miriam Baglioni

fixed issue

56884 09/08/2019 10:52 AM Miriam Baglioni

moved resolution of mapping organization- communities to mapper

56878 07/08/2019 05:35 PM Miriam Baglioni

removed try catch for key validity

56876 07/08/2019 05:22 PM Claudio Atzori

using Text as grouping key type between mapper and reducer

56875 07/08/2019 04:05 PM Miriam Baglioni

fixed error

56872 07/08/2019 03:31 PM Miriam Baglioni

not needed class

56870 07/08/2019 03:28 PM Miriam Baglioni

implementation of propagation of result to community through organization

56869 07/08/2019 03:17 PM Miriam Baglioni

added new util

56868 07/08/2019 03:16 PM Miriam Baglioni

refactor and cleaning

56867 07/08/2019 03:16 PM Miriam Baglioni

refactor for the classname

56866 07/08/2019 03:15 PM Miriam Baglioni

added classname information. Save the same context just once for the update in h-base

56865 07/08/2019 03:12 PM Miriam Baglioni

added class_name information to discriminate from bulktagging reasons. Added class to gather the bulktagging constants

56849 05/08/2019 06:34 PM Claudio Atzori

use group max size from the wf configuration

56676 22/07/2019 04:08 PM Claudio Atzori

added mapper to filter XML (index) records according to a set of criteria

56584 17/07/2019 12:06 PM Claudio Atzori

map only job that integrates the body updates before exporting them

56507 12/07/2019 12:40 PM Miriam Baglioni

fixed type issue

56491 12/07/2019 10:13 AM Miriam Baglioni

used class to extend ArrayList<String>

56442 09/07/2019 03:25 PM Miriam Baglioni

removed thrown exception when path not found in json

56440 09/07/2019 03:15 PM Miriam Baglioni

correct serialization of proto as json

56425 09/07/2019 12:03 AM Miriam Baglioni

added class that extends hashMap<String,String> not to need reflection

56394 08/07/2019 12:45 PM Miriam Baglioni

naming refactor

56384 08/07/2019 09:55 AM Miriam Baglioni

Alternative implementation w.r.t. reflection for deserializing map from json

56279 30/06/2019 09:49 AM Claudio Atzori

NPE check on publisher ugly hack

56246 27/06/2019 06:42 PM Miriam Baglioni

Adding new common methods for propagation of community to result through semantic relation

56245 27/06/2019 06:41 PM Miriam Baglioni

Update of propagation constants for propagation of community to result through semantic relation