Project

General

Profile

Statistics
| Revision:

# Date Author Comment
57749 28/11/2019 03:29 PM Claudio Atzori

avoid NPEs mapping DOIBoost records

57724 27/11/2019 05:59 PM Miriam Baglioni

refactoring

57723 27/11/2019 05:58 PM Miriam Baglioni

update of the propagation constant for the propagation of organization relation for projects belonging to a ds related to the org

57722 27/11/2019 05:57 PM Miriam Baglioni

propagation of relationship between products of a datasource and the organization(s) the datasource is related to. The propagation is done if the ds in an institutional repository and if the result is not already linked to the organization. No addition to the provenance of the relation is added in case it would already be present

57721 27/11/2019 05:55 PM Miriam Baglioni

refactoring

57717 27/11/2019 04:24 PM Miriam Baglioni

refactor needed because the InstOrgKey class has been moved to dedicated package

57716 27/11/2019 04:09 PM Miriam Baglioni

update of the propagation constant to fit also the propagation of author affiliation

57715 27/11/2019 04:08 PM Miriam Baglioni

propagation of affiliation relation (hasAuthorInstitution) from result to organization through results linked by strong semantic relations. Results already associated to the organization will not have the hasAuthorInstitution relation overwritten

57714 27/11/2019 04:06 PM Miriam Baglioni

logic of the compositekeys for propagation moved on dedicated package. Ordering datasource, organization, results on the key that groups for the reducer

57713 27/11/2019 04:05 PM Miriam Baglioni

try to fix issue with svn

57712 27/11/2019 03:50 PM Miriam Baglioni

logic of the compositekeys for propagation moved on dedicated package. Ordering datasource, organization, results on the key that groups for the reducer

57711 27/11/2019 03:48 PM Miriam Baglioni

moved the implementation of the composite keys in a dedicated package since it is used in mode than one propagation

57709 27/11/2019 02:43 PM Claudio Atzori

better handling of side cases

57699 27/11/2019 10:03 AM Sandro La Bruzzo

fixed null abstract

57656 21/11/2019 10:00 AM Michele De Bonis

set ignore file

57655 21/11/2019 09:58 AM Michele De Bonis

modification to fit with the tree-dedup

57634 19/11/2019 11:24 AM Claudio Atzori

[broker] factored out method to obtain the key to be emitted by the enrichment map phase

57633 18/11/2019 10:37 AM Claudio Atzori

ORCID events are not yet ready for production

57632 18/11/2019 10:01 AM Claudio Atzori

limit the dnet-pace-core dependency to use version 3.0.14, i.e. prior to the introduction of the translation map in the configuration

57589 12/11/2019 03:28 PM Miriam Baglioni

refactoring. changed name to constant

57583 11/11/2019 06:22 PM Miriam Baglioni

Test for propagation of result to community through organization

57582 11/11/2019 06:21 PM Miriam Baglioni

Test for ORCID propagation to result through semantic relation

57581 11/11/2019 06:08 PM Miriam Baglioni

update of propagation constant for orcid propagation and propagation of product to organization through semantic relation

57580 11/11/2019 06:07 PM Miriam Baglioni

mapreduce job for the propagation of ORCID through result. Follows only isSupplementedBy, isSupplementTo semantic relations

57560 11/11/2019 11:37 AM Miriam Baglioni

Added one counter to count the number of results per community

57556 08/11/2019 05:07 PM Claudio Atzori

DOIBOOST mapping: include dates formatted as \d{4}-\d{1,2}-\d{1,2}, discard records not providing at least one date

57553 08/11/2019 03:57 PM Sandro La Bruzzo

fixed valid date Actions

57544 07/11/2019 06:11 PM Claudio Atzori

aligned infospace exporter with dhp-schema:1.0.4

57538 07/11/2019 01:04 PM Michele De Bonis

create branch for tree dedup

57533 06/11/2019 03:50 PM Claudio Atzori

fixed infospace export procedure, avoid to emit the same result more than once

57520 05/11/2019 05:26 PM Claudio Atzori

fixed infospace export procedure, avoid to emit the same result more than once

57517 05/11/2019 04:44 PM Michele Artini

OpenOrgs DB: use of tsv for rels

57513 05/11/2019 03:58 PM Claudio Atzori

fixed infospace export procedure, avoid to emit the same result more than once

57510 05/11/2019 12:19 PM Michele Artini

deduped orgs to OpenOrgs DB (jobs + wfs) using temporary hdfs files

57509 05/11/2019 10:53 AM Claudio Atzori

context propagation of projects rels through semantic rels: fixed field assignment when building the relation qualifier

57508 04/11/2019 03:10 PM Claudio Atzori

fixed export procedure: include also relationships

57507 04/11/2019 02:02 PM Claudio Atzori

added dhp mapping test

57505 04/11/2019 12:02 PM Claudio Atzori

included infospace mapping towards the new OAF DHP model

57502 04/11/2019 09:57 AM Sandro La Bruzzo

Added new mapper

57472 29/10/2019 09:23 PM Claudio Atzori

avoid to print the job configuration in the setup phase

57450 24/10/2019 12:00 PM Michele Artini

use of dnet-pace-core 3.0.15

57443 22/10/2019 02:59 PM Claudio Atzori

put operations WAS SYNC

57432 21/10/2019 04:41 PM Miriam Baglioni

Added new propagation constants for the ORCID propagation

57397 17/10/2019 11:14 AM Michele Artini

(openorgs) added schemeid to pids in sql query

57379 15/10/2019 05:41 PM Claudio Atzori

pick the first mergedIn identifier

57322 08/10/2019 06:04 PM Claudio Atzori

using streams instead of guava collection transformations

57270 04/10/2019 03:41 PM Enrico Ottonello

added firstname and surname in case of authors without orcid id, extracted from fullname, needed for propagation

57231 01/10/2019 05:18 PM Claudio Atzori

exclude the mapping-utils version inherited from transitive dependencies

57211 01/10/2019 11:45 AM Miriam Baglioni

final logic of propagation of community through organization (products belonging to given organization will be associated to the community)

57203 30/09/2019 02:27 PM Alessia Bardi

Testing sygma

57193 30/09/2019 11:38 AM Sandro La Bruzzo

fixed test

57188 30/09/2019 11:01 AM Claudio Atzori

removed project reference from src/test/resources/eu/dnetlib/data/transform/odf.xml, the test didn't include any check against it

57186 30/09/2019 10:45 AM Sandro La Bruzzo

fixed problem on subject null in scholixToActions

57162 26/09/2019 04:18 PM Alessia Bardi

Test for checking: #4911 (missing collectedfrom/hostedby identifiers)

57092 18/09/2019 04:17 PM Alessia Bardi

Testing guidelines 4 with current odf2hbase mapping (repo: Aria)

57091 18/09/2019 04:06 PM Alessia Bardi

tests for qeios

56886 09/08/2019 11:50 AM Miriam Baglioni

fix

56885 09/08/2019 11:41 AM Miriam Baglioni

fixed issue

56884 09/08/2019 10:52 AM Miriam Baglioni

moved resolution of mapping organization- communities to mapper

56878 07/08/2019 05:35 PM Miriam Baglioni

removed try catch for key validity

56876 07/08/2019 05:22 PM Claudio Atzori

using Text as grouping key type between mapper and reducer

56875 07/08/2019 04:05 PM Miriam Baglioni

fixed error

56872 07/08/2019 03:31 PM Miriam Baglioni

not needed class

56871 07/08/2019 03:29 PM Miriam Baglioni

new tests and resources

56870 07/08/2019 03:28 PM Miriam Baglioni

implementation of propagation of result to community through organization

56869 07/08/2019 03:17 PM Miriam Baglioni

added new util

56868 07/08/2019 03:16 PM Miriam Baglioni

refactor and cleaning

56867 07/08/2019 03:16 PM Miriam Baglioni

refactor for the classname

56866 07/08/2019 03:15 PM Miriam Baglioni

added classname information. Save the same context just once for the update in h-base

56865 07/08/2019 03:12 PM Miriam Baglioni

added class_name information to discriminate from bulktagging reasons. Added class to gather the bulktagging constants

56849 05/08/2019 06:34 PM Claudio Atzori

use group max size from the wf configuration

56676 22/07/2019 04:08 PM Claudio Atzori

added mapper to filter XML (index) records according to a set of criteria

56584 17/07/2019 12:06 PM Claudio Atzori

map only job that integrates the body updates before exporting them

56507 12/07/2019 12:40 PM Miriam Baglioni

fixed type issue

56499 12/07/2019 11:03 AM Miriam Baglioni

[maven-release-plugin] prepare for next development iteration

56498 12/07/2019 11:03 AM Miriam Baglioni

[maven-release-plugin] copy for tag dnet-mapreduce-jobs-1.2.0

56497 12/07/2019 11:03 AM Miriam Baglioni

[maven-release-plugin] prepare release dnet-mapreduce-jobs-1.2.0

56495 12/07/2019 10:19 AM Miriam Baglioni

new test

56491 12/07/2019 10:13 AM Miriam Baglioni

used class to extend ArrayList<String>

56451 10/07/2019 09:47 AM Miriam Baglioni

updated test and added test files

56442 09/07/2019 03:25 PM Miriam Baglioni

removed thrown exception when path not found in json

56440 09/07/2019 03:15 PM Miriam Baglioni

correct serialization of proto as json

56425 09/07/2019 12:03 AM Miriam Baglioni

added class that extends hashMap<String,String> not to need reflection

56394 08/07/2019 12:45 PM Miriam Baglioni

naming refactor

56385 08/07/2019 09:56 AM Miriam Baglioni

modified test for changed implemetation of deserialization of map from json

56384 08/07/2019 09:55 AM Miriam Baglioni

Alternative implementation w.r.t. reflection for deserializing map from json

56279 30/06/2019 09:49 AM Claudio Atzori

NPE check on publisher ugly hack

56246 27/06/2019 06:42 PM Miriam Baglioni

Adding new common methods for propagation of community to result through semantic relation

56245 27/06/2019 06:41 PM Miriam Baglioni

Update of propagation constants for propagation of community to result through semantic relation

56244 27/06/2019 06:40 PM Miriam Baglioni

propagation of community to result through semantic relation

56214 27/06/2019 11:01 AM Claudio Atzori

print the dedup config string before parsing it

56204 26/06/2019 11:11 AM Claudio Atzori

prefixes must have length = 12

56168 21/06/2019 03:11 PM Enrico Ottonello

modified generated json for proto structure test

56167 21/06/2019 03:10 PM Enrico Ottonello

added some fields to generated json

56166 21/06/2019 03:09 PM Enrico Ottonello

added other counters

56146 20/06/2019 07:40 PM Alessia Bardi

Fixes #4362: Scielo is an Open Access Publisher

56145 20/06/2019 07:30 PM Alessia Bardi

Instance from Crossref restricted by default instead of closed

56144 20/06/2019 07:28 PM Alessia Bardi

RESTRICTED instead of CLOSED, fixed access mode names

56143 20/06/2019 06:55 PM Alessia Bardi

Fixes #4562 (orcid format)

56141 20/06/2019 06:02 PM Alessia Bardi

Added case for invalid author