Project

General

Profile

Statistics
| Revision:

# Date Author Comment
57713 27/11/2019 16:05 Miriam Baglioni

try to fix issue with svn

57712 27/11/2019 15:50 Miriam Baglioni

logic of the compositekeys for propagation moved on dedicated package. Ordering datasource, organization, results on the key that groups for the reducer

57711 27/11/2019 15:48 Miriam Baglioni

moved the implementation of the composite keys in a dedicated package since it is used in mode than one propagation

57709 27/11/2019 14:43 Claudio Atzori

better handling of side cases

57699 27/11/2019 10:03 Sandro La Bruzzo

fixed null abstract

57634 19/11/2019 11:24 Claudio Atzori

[broker] factored out method to obtain the key to be emitted by the enrichment map phase

57633 18/11/2019 10:37 Claudio Atzori

ORCID events are not yet ready for production

57632 18/11/2019 10:01 Claudio Atzori

limit the dnet-pace-core dependency to use version 3.0.14, i.e. prior to the introduction of the translation map in the configuration

57589 12/11/2019 15:28 Miriam Baglioni

refactoring. changed name to constant

57581 11/11/2019 18:08 Miriam Baglioni

update of propagation constant for orcid propagation and propagation of product to organization through semantic relation

57580 11/11/2019 18:07 Miriam Baglioni

mapreduce job for the propagation of ORCID through result. Follows only isSupplementedBy, isSupplementTo semantic relations

57560 11/11/2019 11:37 Miriam Baglioni

Added one counter to count the number of results per community

57556 08/11/2019 17:07 Claudio Atzori

DOIBOOST mapping: include dates formatted as \d{4}-\d{1,2}-\d{1,2}, discard records not providing at least one date

57553 08/11/2019 15:57 Sandro La Bruzzo

fixed valid date Actions

57544 07/11/2019 18:11 Claudio Atzori

aligned infospace exporter with dhp-schema:1.0.4

57533 06/11/2019 15:50 Claudio Atzori

fixed infospace export procedure, avoid to emit the same result more than once

57520 05/11/2019 17:26 Claudio Atzori

fixed infospace export procedure, avoid to emit the same result more than once

57517 05/11/2019 16:44 Michele Artini

OpenOrgs DB: use of tsv for rels

57513 05/11/2019 15:58 Claudio Atzori

fixed infospace export procedure, avoid to emit the same result more than once

57510 05/11/2019 12:19 Michele Artini

deduped orgs to OpenOrgs DB (jobs + wfs) using temporary hdfs files

57509 05/11/2019 10:53 Claudio Atzori

context propagation of projects rels through semantic rels: fixed field assignment when building the relation qualifier

57508 04/11/2019 15:10 Claudio Atzori

fixed export procedure: include also relationships

57507 04/11/2019 14:02 Claudio Atzori

added dhp mapping test

57505 04/11/2019 12:02 Claudio Atzori

included infospace mapping towards the new OAF DHP model

57502 04/11/2019 09:57 Sandro La Bruzzo

Added new mapper

57472 29/10/2019 21:23 Claudio Atzori

avoid to print the job configuration in the setup phase

57450 24/10/2019 12:00 Michele Artini

use of dnet-pace-core 3.0.15

57443 22/10/2019 14:59 Claudio Atzori

put operations WAS SYNC

57432 21/10/2019 16:41 Miriam Baglioni

Added new propagation constants for the ORCID propagation

57379 15/10/2019 17:41 Claudio Atzori

pick the first mergedIn identifier

57322 08/10/2019 18:04 Claudio Atzori

using streams instead of guava collection transformations

57270 04/10/2019 15:41 Enrico Ottonello

added firstname and surname in case of authors without orcid id, extracted from fullname, needed for propagation

57211 01/10/2019 11:45 Miriam Baglioni

final logic of propagation of community through organization (products belonging to given organization will be associated to the community)

57193 30/09/2019 11:38 Sandro La Bruzzo

fixed test

57188 30/09/2019 11:01 Claudio Atzori

removed project reference from src/test/resources/eu/dnetlib/data/transform/odf.xml, the test didn't include any check against it

57186 30/09/2019 10:45 Sandro La Bruzzo

fixed problem on subject null in scholixToActions

56886 09/08/2019 11:50 Miriam Baglioni

fix

56885 09/08/2019 11:41 Miriam Baglioni

fixed issue

56884 09/08/2019 10:52 Miriam Baglioni

moved resolution of mapping organization- communities to mapper

56878 07/08/2019 17:35 Miriam Baglioni

removed try catch for key validity

56876 07/08/2019 17:22 Claudio Atzori

using Text as grouping key type between mapper and reducer

56875 07/08/2019 16:05 Miriam Baglioni

fixed error

56872 07/08/2019 15:31 Miriam Baglioni

not needed class

56870 07/08/2019 15:28 Miriam Baglioni

implementation of propagation of result to community through organization

56869 07/08/2019 15:17 Miriam Baglioni

added new util

56868 07/08/2019 15:16 Miriam Baglioni

refactor and cleaning

56867 07/08/2019 15:16 Miriam Baglioni

refactor for the classname

56866 07/08/2019 15:15 Miriam Baglioni

added classname information. Save the same context just once for the update in h-base

56865 07/08/2019 15:12 Miriam Baglioni

added class_name information to discriminate from bulktagging reasons. Added class to gather the bulktagging constants

56849 05/08/2019 18:34 Claudio Atzori

use group max size from the wf configuration

56676 22/07/2019 16:08 Claudio Atzori

added mapper to filter XML (index) records according to a set of criteria

56584 17/07/2019 12:06 Claudio Atzori

map only job that integrates the body updates before exporting them

56507 12/07/2019 12:40 Miriam Baglioni

fixed type issue

56491 12/07/2019 10:13 Miriam Baglioni

used class to extend ArrayList<String>

56442 09/07/2019 15:25 Miriam Baglioni

removed thrown exception when path not found in json

56440 09/07/2019 15:15 Miriam Baglioni

correct serialization of proto as json

56425 09/07/2019 00:03 Miriam Baglioni

added class that extends hashMap<String,String> not to need reflection

56394 08/07/2019 12:45 Miriam Baglioni

naming refactor

56384 08/07/2019 09:55 Miriam Baglioni

Alternative implementation w.r.t. reflection for deserializing map from json

56279 30/06/2019 09:49 Claudio Atzori

NPE check on publisher ugly hack

56246 27/06/2019 18:42 Miriam Baglioni

Adding new common methods for propagation of community to result through semantic relation

56245 27/06/2019 18:41 Miriam Baglioni

Update of propagation constants for propagation of community to result through semantic relation

56244 27/06/2019 18:40 Miriam Baglioni

propagation of community to result through semantic relation

56214 27/06/2019 11:01 Claudio Atzori

print the dedup config string before parsing it

56204 26/06/2019 11:11 Claudio Atzori

prefixes must have length = 12

56167 21/06/2019 15:10 Enrico Ottonello

added some fields to generated json

56166 21/06/2019 15:09 Enrico Ottonello

added other counters

56146 20/06/2019 19:40 Alessia Bardi

Fixes #4362: Scielo is an Open Access Publisher

56145 20/06/2019 19:30 Alessia Bardi

Instance from Crossref restricted by default instead of closed

56144 20/06/2019 19:28 Alessia Bardi

RESTRICTED instead of CLOSED, fixed access mode names

56143 20/06/2019 18:55 Alessia Bardi

Fixes #4562 (orcid format)

56141 20/06/2019 18:02 Alessia Bardi

Added case for invalid author

56135 20/06/2019 12:36 Alessia Bardi

Another test publisher

56134 20/06/2019 11:56 Alessia Bardi

More cases to discard a record for test authors

56133 20/06/2019 11:38 Alessia Bardi

Fix #4637 and improve check for invalid authors

56120 19/06/2019 17:27 Claudio Atzori

software link export job

56098 18/06/2019 14:49 Claudio Atzori

added M/R to export publication-software links

56095 14/06/2019 15:54 Miriam Baglioni

added openaire id to publication

56088 14/06/2019 14:41 Miriam Baglioni
56087 14/06/2019 14:41 Miriam Baglioni
56084 14/06/2019 14:35 Miriam Baglioni

minor (constant has been renamed)

56083 14/06/2019 14:33 Miriam Baglioni

Starting the implementation for propagation of result to community. Result linked by isSupplementedBy to result associatied to the community is linked to the community

56082 14/06/2019 14:31 Miriam Baglioni

update for propagation community-result

56081 14/06/2019 14:30 Miriam Baglioni

added logic for selection criteria implementation on datasources

56079 14/06/2019 12:48 Miriam Baglioni

export of link between publication and software (related to #4593)

56016 06/06/2019 16:53 Enrico Ottonello
56011 06/06/2019 12:12 Alessia Bardi

print json but commented out.
Added test to get a proto from a json

55990 04/06/2019 18:10 Enrico Ottonello

added resourcetype and resulttype, new work type mapping

55895 29/05/2019 10:30 Enrico Ottonello

code cleaning

55887 28/05/2019 18:35 Alessia Bardi

code adapted to new version of mapping utils 6.3.25 which supports journal information also on data sources

55844 28/05/2019 16:20 Enrico Ottonello

this mapper handles orcid publications (without doi) actions

55522 07/05/2019 18:31 Enrico Ottonello

generate actionset for orcid works and unit tests

55349 15/04/2019 10:23 Alessia Bardi

Avoid nullpointer for publisher

55246 09/04/2019 18:06 Alessia Bardi

Discard records without a valid author as requested in #4392, #4393, #4395, #4396.
If the record has also at least one valid author, the record is kept but the invalid authors are removed.

55239 08/04/2019 19:08 Alessia Bardi

Workaround for #4362: instances from Unpaywall mapped into licenses and then lost

55238 08/04/2019 18:08 Alessia Bardi

Addressing quality of the research graph: #4368 and #4360.

55144 01/04/2019 16:53 Miriam Baglioni

add provenance to bulktagging

55076 22/03/2019 18:26 Alessia Bardi

updated label for a counter that I hope I will never see

54872 20/02/2019 15:05 Miriam Baglioni

removed useless comment

54871 20/02/2019 14:57 Miriam Baglioni

refactoring and update of default variable values