Project

General

Profile

Statistics
| Revision:

# Date Author Comment
57767 29/11/2019 01:06 PM Michele De Bonis

update in the generation of the master index

57656 21/11/2019 10:00 AM Michele De Bonis

set ignore file

57655 21/11/2019 09:58 AM Michele De Bonis

modification to fit with the tree-dedup

57538 07/11/2019 01:04 PM Michele De Bonis

create branch for tree dedup

55437 29/04/2019 02:01 PM Alessia Bardi

Updated DOIBoost test as they are in the trunk branch

55436 29/04/2019 01:43 PM Alessia Bardi

Updated classes for DOIBoost based on the trunk version

54922 27/02/2019 04:35 PM Michele De Bonis

implementation of the procedure to export native softwares on hdfs
addition of needed workflows and classes

54717 07/02/2019 03:51 PM Miriam Baglioni
54716 07/02/2019 03:18 PM Miriam Baglioni

actually incrementing the counters just added

54715 06/02/2019 04:09 PM Miriam Baglioni

added counters for matching subject, content providers and zenodo community for each community

54576 16/01/2019 04:08 PM Miriam Baglioni

test for the new implementation of zenodo community value

54575 16/01/2019 04:02 PM Miriam Baglioni

changed to consider to mirror the change in zenodo community value: zenodo community instead of the openaire community associated to a result. The context of the zenodo community will be removed from those of the result if the zenodo community is not associated to any openaire community.

54505 28/12/2018 02:57 PM Claudio Atzori

replaced CrossRef with Crossref

54504 28/12/2018 02:57 PM Claudio Atzori

depending on released parent, updated dependency dnet-openaireplus-mapping-utils to most recent release

54284 10/12/2018 04:00 PM Claudio Atzori

bumped version dep dnet-openaireplus-mapping-utils:6.2.25

54187 06/12/2018 04:08 PM Claudio Atzori

fixed creation of the coordinates (columnFamily) driving the Put operation

54186 06/12/2018 04:07 PM Claudio Atzori

less log verbosity

54182 06/12/2018 11:01 AM Claudio Atzori

fixRelations must work on main entities in a single scan pass

54175 05/12/2018 05:04 PM Claudio Atzori

added support for simulation mode: allows to do not change the data and keep track of summary counters

54173 05/12/2018 04:41 PM Claudio Atzori

reverted version of mapping-utils dependency

54172 05/12/2018 04:35 PM Claudio Atzori

bumped version of mapping-utils dependency

54167 05/12/2018 03:57 PM Claudio Atzori

[maven-release-plugin] prepare for next development iteration

54165 05/12/2018 03:57 PM Claudio Atzori

[maven-release-plugin] prepare release dnet-mapreduce-jobs-1.1.12-MASTER

54164 05/12/2018 03:56 PM Claudio Atzori

introduced jobs to fix the relationships among deduped records. Got rid of deprecations on the HBase Put method usage

54125 04/12/2018 05:09 PM Miriam Baglioni

refactoring and fixed issue for empty project list

54093 03/12/2018 05:43 PM Miriam Baglioni

Added new check for result type checking inside list of results in country propagation

54085 03/12/2018 05:01 PM Miriam Baglioni

fixed issue for empty result in list of results to which propagate the country (last fix produced a bug)

54084 03/12/2018 05:00 PM Miriam Baglioni

changed message in context

54073 03/12/2018 11:40 AM Claudio Atzori

bumped version after fixing bug for list of result(50|) empty - Propagation

54072 03/12/2018 11:39 AM Miriam Baglioni

fixed bug for list of result(50|) empty

54070 03/12/2018 11:11 AM Miriam Baglioni

Propagation general Iterator and Contants class

54069 03/12/2018 11:03 AM Miriam Baglioni

Iterator to handle results from mapper for project propagation

54068 03/12/2018 11:02 AM Miriam Baglioni

Reducers fro project propagation to File and to HBase

54067 03/12/2018 11:01 AM Miriam Baglioni

Iterator for managing values from mapper for result country propagation

54064 29/11/2018 05:59 PM Miriam Baglioni

fixed issue

54045 29/11/2018 12:29 PM Miriam Baglioni
54041 29/11/2018 12:15 PM Claudio Atzori

using most recent release of dnet-openaireplus-mapping-utils

54040 29/11/2018 12:07 PM Miriam Baglioni

new Exception class for not valid list of values in reducer

54039 29/11/2018 12:06 PM Miriam Baglioni

modified type for type variable. From int to Type

54038 29/11/2018 12:05 PM Miriam Baglioni

added new variables and methods

54037 29/11/2018 12:04 PM Miriam Baglioni

updated implementation to use iterator

54036 29/11/2018 12:01 PM Miriam Baglioni

specific iterator for country propagation

54035 29/11/2018 12:01 PM Miriam Baglioni

Added generic Iterator over the list of information gathered from the reducer

54012 28/11/2018 02:24 PM Sandro La Bruzzo

fix error on parsing abstract

54010 28/11/2018 10:59 AM Miriam Baglioni

updated to consider the new class Utils and the new values in Propagation Constants classes

54009 28/11/2018 10:55 AM Miriam Baglioni

updated to store properties of the value

54008 28/11/2018 10:54 AM Miriam Baglioni

class to contain the methods common to the propagation classes among the propagation types

54007 28/11/2018 10:53 AM Miriam Baglioni

update of the constants for the propagation

54006 28/11/2018 10:51 AM Miriam Baglioni

map-reduce to implement the propagation of the association between project and result through the existence of semantic relations

54005 28/11/2018 10:48 AM Miriam Baglioni

map to count the number of results having a semantic relation of type isSupplementedBy that also have a relation isProducedBy with at least one project

53983 27/11/2018 12:06 PM Claudio Atzori

added test to verify the parsing of EPMC record

53883 19/11/2018 05:43 PM Claudio Atzori

depending on dnet-openaireplus-mapping-utils:6.2.21

53779 15/11/2018 12:25 PM Claudio Atzori

avoid to import non necessary affiliations from DOIBoost

53763 14/11/2018 05:28 PM Alessia Bardi

depend on mappingutils 6.2.19 for better mapping of software

53762 14/11/2018 05:24 PM Alessia Bardi

[maven-release-plugin] prepare for next development iteration

53760 14/11/2018 05:24 PM Alessia Bardi

[maven-release-plugin] prepare release dnet-mapreduce-jobs-1.1.10-MASTER

53756 14/11/2018 05:13 PM Alessia Bardi

Testing software from biotools

53737 13/11/2018 05:21 PM Sandro La Bruzzo

fixed bug for uncompressed abstract in DOIBoostToAction

53736 13/11/2018 04:55 PM Sandro La Bruzzo

changed mapping for compressed abstract in DOIBoostToAction

53726 13/11/2018 09:08 AM Claudio Atzori

less verbose logging

53721 12/11/2018 03:55 PM Claudio Atzori

excluded clashing versions of jackson. We want to keep ours

53720 12/11/2018 03:54 PM Claudio Atzori

using proper import for LogFactory

53718 12/11/2018 12:00 PM Claudio Atzori

[maven-release-plugin] prepare for next development iteration

53716 12/11/2018 12:00 PM Claudio Atzori

[maven-release-plugin] prepare release dnet-mapreduce-jobs-1.1.9-MASTER

53715 12/11/2018 11:51 AM Claudio Atzori

using most recent dnet-openaireplus-mapping utils version (6.2.18)

53710 09/11/2018 07:01 PM Claudio Atzori

added Mapper and Reducer class for infoSpace counts workflows

53687 09/11/2018 02:17 PM Claudio Atzori

added reducer to produce counts on the infospace

53684 09/11/2018 12:28 PM Claudio Atzori

added reducer to produce counts on the infospace

53628 05/11/2018 04:41 PM Claudio Atzori

cannot use Guava's Splitter.splitToList, must stick to basic split method. Classpath is messed up

53604 31/10/2018 11:34 AM Claudio Atzori

implemented ConfigurableExportMapper

53602 31/10/2018 10:36 AM Sandro La Bruzzo

Removed un-used import

53592 30/10/2018 12:36 PM Sandro La Bruzzo

fixed test

53591 30/10/2018 12:30 PM Sandro La Bruzzo

updated Mapper to return the whole invalid record

53589 30/10/2018 12:27 PM Claudio Atzori

export invalid xml records

53588 30/10/2018 12:26 PM Sandro La Bruzzo

refactored Action

53572 29/10/2018 10:12 AM Claudio Atzori

Map only job that produces [openaireId, doi] pairs of records containing invalid characters

53571 29/10/2018 10:10 AM Claudio Atzori

reduced file size

53565 26/10/2018 03:40 PM Sandro La Bruzzo

added parameter to filter only organization in DOIBoostToAction

53556 25/10/2018 04:30 PM Sandro La Bruzzo

fixed problem of missing name in authors

53554 25/10/2018 12:26 PM Sandro La Bruzzo

merged beta branch to master

53546 23/10/2018 03:45 PM Sandro La Bruzzo

Change Mapper to implement DOIBoostToAction

53544 23/10/2018 03:41 PM Sandro La Bruzzo

Created DOIBoostToActions Mapping

53518 18/10/2018 02:48 PM Claudio Atzori

introduced use of BlockProcessor

53467 15/10/2018 03:23 PM Claudio Atzori

[maven-release-plugin] prepare for next development iteration

53465 15/10/2018 03:23 PM Claudio Atzori

[maven-release-plugin] prepare release dnet-mapreduce-jobs-1.1.8-MASTER

53461 15/10/2018 12:57 PM Claudio Atzori

updated test for hbase mapping for organizations

53437 10/10/2018 06:43 PM Claudio Atzori

bumped version, dnet-openaireplus-mapping-utils:6.2.15 should fix the unmapped instancetype terms

53436 10/10/2018 06:39 PM Claudio Atzori

updated dnet-openaireplus-mapping-utils dependency version

53421 09/10/2018 11:55 AM Miriam Baglioni

fixed issue when country information is not present for datasource

53419 08/10/2018 10:14 AM Miriam Baglioni

change throwing of exception with counters

53418 06/10/2018 07:49 PM Claudio Atzori

using updated mapping-utils module

53410 05/10/2018 04:18 PM Miriam Baglioni

change parameter from ImmutableBytesWritable to Text

53409 05/10/2018 04:17 PM Miriam Baglioni

refactoring and change of counters

53408 05/10/2018 04:09 PM Claudio Atzori

using updated mapping-utils module, added unit test to check the merge procedure for context and country updates

53407 05/10/2018 04:06 PM Claudio Atzori

rollback wrong commit

53386 04/10/2018 03:35 PM Claudio Atzori

fixing and testing propagation implementation

53383 04/10/2018 02:46 PM Miriam Baglioni

reducer for country propagation that writes on hdfs

53380 03/10/2018 04:19 PM Miriam Baglioni
53371 03/10/2018 10:22 AM Claudio Atzori

cleanup pid types in order to make them valid attributes

53369 02/10/2018 02:08 PM Miriam Baglioni