Project

General

Profile

Statistics
| Revision:

# Date Author Comment
57925 20/12/2019 10:42 AM Michele Artini
57919 19/12/2019 10:17 AM Claudio Atzori

using 'jar-with-dependencies' suffix for the uber jar

57918 19/12/2019 10:10 AM Claudio Atzori

added deploy.info descriptor, using 'assembly' suffix for the uber jar

57917 19/12/2019 09:54 AM Claudio Atzori

removed useless logs, they might clog the cluster resources

57838 09/12/2019 11:05 AM Michele De Bonis

addition of the trust in the mapping of the datainfo

57828 05/12/2019 05:40 PM Miriam Baglioni

test for propagation of result to organization both via datasource and via semantic relation

57827 05/12/2019 05:32 PM Miriam Baglioni

refactoring

57786 03/12/2019 10:05 AM Miriam Baglioni

code refactoring

57785 03/12/2019 10:04 AM Miriam Baglioni

refactoring

57783 03/12/2019 09:49 AM Claudio Atzori

code formatting

57781 02/12/2019 05:00 PM Claudio Atzori

exclude the assembly creation stage from the package phase, the uber jar will be uploaded on PPA

57780 02/12/2019 04:44 PM Claudio Atzori

removed deploy.info. This module will be managed with a dedicated build job

57779 02/12/2019 03:52 PM Claudio Atzori

include the assembly in the package phase

57778 02/12/2019 03:18 PM Miriam Baglioni

fixed bug: mapper for organizations sent wrong information about the organization id. It was the id of the datasource that was emitted instead of the one of the organization

57775 02/12/2019 11:40 AM Alessia Bardi

Test to check the good hostedby is considered by the mapping

57774 02/12/2019 11:38 AM Alessia Bardi

Handling empty values for float fields like project totalcosts

57767 29/11/2019 01:06 PM Michele De Bonis

update in the generation of the master index

57766 29/11/2019 12:49 PM Claudio Atzori

rehash the entire openaire id when building the root identifier to avoid clashes

57758 28/11/2019 08:12 PM Alessia Bardi

#4961: ensure we properly build XML records of projects and orgs with summary and budget information

57749 28/11/2019 03:29 PM Claudio Atzori

avoid NPEs mapping DOIBoost records

57724 27/11/2019 05:59 PM Miriam Baglioni

refactoring

57723 27/11/2019 05:58 PM Miriam Baglioni

update of the propagation constant for the propagation of organization relation for projects belonging to a ds related to the org

57722 27/11/2019 05:57 PM Miriam Baglioni

propagation of relationship between products of a datasource and the organization(s) the datasource is related to. The propagation is done if the ds in an institutional repository and if the result is not already linked to the organization. No addition to the provenance of the relation is added in case it would already be present

57721 27/11/2019 05:55 PM Miriam Baglioni

refactoring

57717 27/11/2019 04:24 PM Miriam Baglioni

refactor needed because the InstOrgKey class has been moved to dedicated package

57716 27/11/2019 04:09 PM Miriam Baglioni

update of the propagation constant to fit also the propagation of author affiliation

57715 27/11/2019 04:08 PM Miriam Baglioni

propagation of affiliation relation (hasAuthorInstitution) from result to organization through results linked by strong semantic relations. Results already associated to the organization will not have the hasAuthorInstitution relation overwritten

57714 27/11/2019 04:06 PM Miriam Baglioni

logic of the compositekeys for propagation moved on dedicated package. Ordering datasource, organization, results on the key that groups for the reducer

57713 27/11/2019 04:05 PM Miriam Baglioni

try to fix issue with svn

57712 27/11/2019 03:50 PM Miriam Baglioni

logic of the compositekeys for propagation moved on dedicated package. Ordering datasource, organization, results on the key that groups for the reducer

57711 27/11/2019 03:48 PM Miriam Baglioni

moved the implementation of the composite keys in a dedicated package since it is used in mode than one propagation

57709 27/11/2019 02:43 PM Claudio Atzori

better handling of side cases

57699 27/11/2019 10:03 AM Sandro La Bruzzo

fixed null abstract

57656 21/11/2019 10:00 AM Michele De Bonis

set ignore file

57655 21/11/2019 09:58 AM Michele De Bonis

modification to fit with the tree-dedup

57634 19/11/2019 11:24 AM Claudio Atzori

[broker] factored out method to obtain the key to be emitted by the enrichment map phase

57633 18/11/2019 10:37 AM Claudio Atzori

ORCID events are not yet ready for production

57632 18/11/2019 10:01 AM Claudio Atzori

limit the dnet-pace-core dependency to use version 3.0.14, i.e. prior to the introduction of the translation map in the configuration

57589 12/11/2019 03:28 PM Miriam Baglioni

refactoring. changed name to constant

57583 11/11/2019 06:22 PM Miriam Baglioni

Test for propagation of result to community through organization

57582 11/11/2019 06:21 PM Miriam Baglioni

Test for ORCID propagation to result through semantic relation

57581 11/11/2019 06:08 PM Miriam Baglioni

update of propagation constant for orcid propagation and propagation of product to organization through semantic relation

57580 11/11/2019 06:07 PM Miriam Baglioni

mapreduce job for the propagation of ORCID through result. Follows only isSupplementedBy, isSupplementTo semantic relations

57560 11/11/2019 11:37 AM Miriam Baglioni

Added one counter to count the number of results per community

57556 08/11/2019 05:07 PM Claudio Atzori

DOIBOOST mapping: include dates formatted as \d{4}-\d{1,2}-\d{1,2}, discard records not providing at least one date

57553 08/11/2019 03:57 PM Sandro La Bruzzo

fixed valid date Actions

57544 07/11/2019 06:11 PM Claudio Atzori

aligned infospace exporter with dhp-schema:1.0.4

57538 07/11/2019 01:04 PM Michele De Bonis

create branch for tree dedup

57533 06/11/2019 03:50 PM Claudio Atzori

fixed infospace export procedure, avoid to emit the same result more than once

57520 05/11/2019 05:26 PM Claudio Atzori

fixed infospace export procedure, avoid to emit the same result more than once

57517 05/11/2019 04:44 PM Michele Artini

OpenOrgs DB: use of tsv for rels

57513 05/11/2019 03:58 PM Claudio Atzori

fixed infospace export procedure, avoid to emit the same result more than once

57510 05/11/2019 12:19 PM Michele Artini

deduped orgs to OpenOrgs DB (jobs + wfs) using temporary hdfs files

57509 05/11/2019 10:53 AM Claudio Atzori

context propagation of projects rels through semantic rels: fixed field assignment when building the relation qualifier

57508 04/11/2019 03:10 PM Claudio Atzori

fixed export procedure: include also relationships

57507 04/11/2019 02:02 PM Claudio Atzori

added dhp mapping test

57505 04/11/2019 12:02 PM Claudio Atzori

included infospace mapping towards the new OAF DHP model

57502 04/11/2019 09:57 AM Sandro La Bruzzo

Added new mapper

57472 29/10/2019 09:23 PM Claudio Atzori

avoid to print the job configuration in the setup phase

57450 24/10/2019 12:00 PM Michele Artini

use of dnet-pace-core 3.0.15

57443 22/10/2019 02:59 PM Claudio Atzori

put operations WAS SYNC

57432 21/10/2019 04:41 PM Miriam Baglioni

Added new propagation constants for the ORCID propagation

57397 17/10/2019 11:14 AM Michele Artini

(openorgs) added schemeid to pids in sql query

57379 15/10/2019 05:41 PM Claudio Atzori

pick the first mergedIn identifier

57322 08/10/2019 06:04 PM Claudio Atzori

using streams instead of guava collection transformations

57270 04/10/2019 03:41 PM Enrico Ottonello

added firstname and surname in case of authors without orcid id, extracted from fullname, needed for propagation

57231 01/10/2019 05:18 PM Claudio Atzori

exclude the mapping-utils version inherited from transitive dependencies

57211 01/10/2019 11:45 AM Miriam Baglioni

final logic of propagation of community through organization (products belonging to given organization will be associated to the community)

57203 30/09/2019 02:27 PM Alessia Bardi

Testing sygma

57193 30/09/2019 11:38 AM Sandro La Bruzzo

fixed test

57188 30/09/2019 11:01 AM Claudio Atzori

removed project reference from src/test/resources/eu/dnetlib/data/transform/odf.xml, the test didn't include any check against it

57186 30/09/2019 10:45 AM Sandro La Bruzzo

fixed problem on subject null in scholixToActions

57162 26/09/2019 04:18 PM Alessia Bardi

Test for checking: #4911 (missing collectedfrom/hostedby identifiers)

57092 18/09/2019 04:17 PM Alessia Bardi

Testing guidelines 4 with current odf2hbase mapping (repo: Aria)

57091 18/09/2019 04:06 PM Alessia Bardi

tests for qeios

56886 09/08/2019 11:50 AM Miriam Baglioni

fix

56885 09/08/2019 11:41 AM Miriam Baglioni

fixed issue

56884 09/08/2019 10:52 AM Miriam Baglioni

moved resolution of mapping organization- communities to mapper

56878 07/08/2019 05:35 PM Miriam Baglioni

removed try catch for key validity

56876 07/08/2019 05:22 PM Claudio Atzori

using Text as grouping key type between mapper and reducer

56875 07/08/2019 04:05 PM Miriam Baglioni

fixed error

56872 07/08/2019 03:31 PM Miriam Baglioni

not needed class

56871 07/08/2019 03:29 PM Miriam Baglioni

new tests and resources

56870 07/08/2019 03:28 PM Miriam Baglioni

implementation of propagation of result to community through organization

56869 07/08/2019 03:17 PM Miriam Baglioni

added new util

56868 07/08/2019 03:16 PM Miriam Baglioni

refactor and cleaning

56867 07/08/2019 03:16 PM Miriam Baglioni

refactor for the classname

56866 07/08/2019 03:15 PM Miriam Baglioni

added classname information. Save the same context just once for the update in h-base

56865 07/08/2019 03:12 PM Miriam Baglioni

added class_name information to discriminate from bulktagging reasons. Added class to gather the bulktagging constants

56849 05/08/2019 06:34 PM Claudio Atzori

use group max size from the wf configuration

56676 22/07/2019 04:08 PM Claudio Atzori

added mapper to filter XML (index) records according to a set of criteria

56584 17/07/2019 12:06 PM Claudio Atzori

map only job that integrates the body updates before exporting them

56507 12/07/2019 12:40 PM Miriam Baglioni

fixed type issue

56499 12/07/2019 11:03 AM Miriam Baglioni

[maven-release-plugin] prepare for next development iteration

56498 12/07/2019 11:03 AM Miriam Baglioni

[maven-release-plugin] copy for tag dnet-mapreduce-jobs-1.2.0

56497 12/07/2019 11:03 AM Miriam Baglioni

[maven-release-plugin] prepare release dnet-mapreduce-jobs-1.2.0

56495 12/07/2019 10:19 AM Miriam Baglioni

new test

56491 12/07/2019 10:13 AM Miriam Baglioni

used class to extend ArrayList<String>

56451 10/07/2019 09:47 AM Miriam Baglioni

updated test and added test files

56442 09/07/2019 03:25 PM Miriam Baglioni

removed thrown exception when path not found in json