Project

General

Profile

Statistics
| Revision:

# Date Author Comment
55437 29/04/2019 02:01 PM Alessia Bardi

Updated DOIBoost test as they are in the trunk branch

55436 29/04/2019 01:43 PM Alessia Bardi

Updated classes for DOIBoost based on the trunk version

54717 07/02/2019 03:51 PM Miriam Baglioni
54716 07/02/2019 03:18 PM Miriam Baglioni

actually incrementing the counters just added

54715 06/02/2019 04:09 PM Miriam Baglioni

added counters for matching subject, content providers and zenodo community for each community

54576 16/01/2019 04:08 PM Miriam Baglioni

test for the new implementation of zenodo community value

54575 16/01/2019 04:02 PM Miriam Baglioni

changed to consider to mirror the change in zenodo community value: zenodo community instead of the openaire community associated to a result. The context of the zenodo community will be removed from those of the result if the zenodo community is not associated to any openaire community.

54505 28/12/2018 02:57 PM Claudio Atzori

replaced CrossRef with Crossref

54187 06/12/2018 04:08 PM Claudio Atzori

fixed creation of the coordinates (columnFamily) driving the Put operation

54186 06/12/2018 04:07 PM Claudio Atzori

less log verbosity

54182 06/12/2018 11:01 AM Claudio Atzori

fixRelations must work on main entities in a single scan pass

54175 05/12/2018 05:04 PM Claudio Atzori

added support for simulation mode: allows to do not change the data and keep track of summary counters

54164 05/12/2018 03:56 PM Claudio Atzori

introduced jobs to fix the relationships among deduped records. Got rid of deprecations on the HBase Put method usage

54125 04/12/2018 05:09 PM Miriam Baglioni

refactoring and fixed issue for empty project list

54093 03/12/2018 05:43 PM Miriam Baglioni

Added new check for result type checking inside list of results in country propagation

54085 03/12/2018 05:01 PM Miriam Baglioni

fixed issue for empty result in list of results to which propagate the country (last fix produced a bug)

54084 03/12/2018 05:00 PM Miriam Baglioni

changed message in context

54072 03/12/2018 11:39 AM Miriam Baglioni

fixed bug for list of result(50|) empty

54070 03/12/2018 11:11 AM Miriam Baglioni

Propagation general Iterator and Contants class

54069 03/12/2018 11:03 AM Miriam Baglioni

Iterator to handle results from mapper for project propagation

54068 03/12/2018 11:02 AM Miriam Baglioni

Reducers fro project propagation to File and to HBase

54067 03/12/2018 11:01 AM Miriam Baglioni

Iterator for managing values from mapper for result country propagation

54064 29/11/2018 05:59 PM Miriam Baglioni

fixed issue

54045 29/11/2018 12:29 PM Miriam Baglioni
54040 29/11/2018 12:07 PM Miriam Baglioni

new Exception class for not valid list of values in reducer

54039 29/11/2018 12:06 PM Miriam Baglioni

modified type for type variable. From int to Type

54038 29/11/2018 12:05 PM Miriam Baglioni

added new variables and methods

54037 29/11/2018 12:04 PM Miriam Baglioni

updated implementation to use iterator

54036 29/11/2018 12:01 PM Miriam Baglioni

specific iterator for country propagation

54035 29/11/2018 12:01 PM Miriam Baglioni

Added generic Iterator over the list of information gathered from the reducer

54012 28/11/2018 02:24 PM Sandro La Bruzzo

fix error on parsing abstract

54010 28/11/2018 10:59 AM Miriam Baglioni

updated to consider the new class Utils and the new values in Propagation Constants classes

54009 28/11/2018 10:55 AM Miriam Baglioni

updated to store properties of the value

54008 28/11/2018 10:54 AM Miriam Baglioni

class to contain the methods common to the propagation classes among the propagation types

54007 28/11/2018 10:53 AM Miriam Baglioni

update of the constants for the propagation

54006 28/11/2018 10:51 AM Miriam Baglioni

map-reduce to implement the propagation of the association between project and result through the existence of semantic relations

54005 28/11/2018 10:48 AM Miriam Baglioni

map to count the number of results having a semantic relation of type isSupplementedBy that also have a relation isProducedBy with at least one project

53983 27/11/2018 12:06 PM Claudio Atzori

added test to verify the parsing of EPMC record

53779 15/11/2018 12:25 PM Claudio Atzori

avoid to import non necessary affiliations from DOIBoost

53756 14/11/2018 05:13 PM Alessia Bardi

Testing software from biotools

53737 13/11/2018 05:21 PM Sandro La Bruzzo

fixed bug for uncompressed abstract in DOIBoostToAction

53736 13/11/2018 04:55 PM Sandro La Bruzzo

changed mapping for compressed abstract in DOIBoostToAction

53726 13/11/2018 09:08 AM Claudio Atzori

less verbose logging

53720 12/11/2018 03:54 PM Claudio Atzori

using proper import for LogFactory

53710 09/11/2018 07:01 PM Claudio Atzori

added Mapper and Reducer class for infoSpace counts workflows

53687 09/11/2018 02:17 PM Claudio Atzori

added reducer to produce counts on the infospace

53684 09/11/2018 12:28 PM Claudio Atzori

added reducer to produce counts on the infospace

53628 05/11/2018 04:41 PM Claudio Atzori

cannot use Guava's Splitter.splitToList, must stick to basic split method. Classpath is messed up

53604 31/10/2018 11:34 AM Claudio Atzori

implemented ConfigurableExportMapper

53602 31/10/2018 10:36 AM Sandro La Bruzzo

Removed un-used import

53592 30/10/2018 12:36 PM Sandro La Bruzzo

fixed test

53591 30/10/2018 12:30 PM Sandro La Bruzzo

updated Mapper to return the whole invalid record

53589 30/10/2018 12:27 PM Claudio Atzori

export invalid xml records

53588 30/10/2018 12:26 PM Sandro La Bruzzo

refactored Action

53572 29/10/2018 10:12 AM Claudio Atzori

Map only job that produces [openaireId, doi] pairs of records containing invalid characters

53571 29/10/2018 10:10 AM Claudio Atzori

reduced file size

53565 26/10/2018 03:40 PM Sandro La Bruzzo

added parameter to filter only organization in DOIBoostToAction

53556 25/10/2018 04:30 PM Sandro La Bruzzo

fixed problem of missing name in authors

53554 25/10/2018 12:26 PM Sandro La Bruzzo

merged beta branch to master

53518 18/10/2018 02:48 PM Claudio Atzori

introduced use of BlockProcessor

53461 15/10/2018 12:57 PM Claudio Atzori

updated test for hbase mapping for organizations

53421 09/10/2018 11:55 AM Miriam Baglioni

fixed issue when country information is not present for datasource

53419 08/10/2018 10:14 AM Miriam Baglioni

change throwing of exception with counters

53410 05/10/2018 04:18 PM Miriam Baglioni

change parameter from ImmutableBytesWritable to Text

53409 05/10/2018 04:17 PM Miriam Baglioni

refactoring and change of counters

53408 05/10/2018 04:09 PM Claudio Atzori

using updated mapping-utils module, added unit test to check the merge procedure for context and country updates

53407 05/10/2018 04:06 PM Claudio Atzori

rollback wrong commit

53386 04/10/2018 03:35 PM Claudio Atzori

fixing and testing propagation implementation

53383 04/10/2018 02:46 PM Miriam Baglioni

reducer for country propagation that writes on hdfs

53380 03/10/2018 04:19 PM Miriam Baglioni
53371 03/10/2018 10:22 AM Claudio Atzori

cleanup pid types in order to make them valid attributes

53369 02/10/2018 02:08 PM Miriam Baglioni
53364 02/10/2018 11:32 AM Alessia Bardi

Need to set resourceTypeGeneral to clinicalTrial as this is where the IIS can distinguish clinical trial records from "normal dataset"

53362 02/10/2018 10:18 AM Miriam Baglioni

added code for propagation of countries from institutional organization

53340 01/10/2018 10:04 AM Claudio Atzori

master branch for deployments @ICM

53336 01/10/2018 09:26 AM Claudio Atzori

why parse strings as Floats?

53288 27/09/2018 01:48 PM Claudio Atzori

reverted to r52985 . Test runs shows we need to rely on the edgeIds produced by the connected components identfication phase instead of the vertexIds

53280 26/09/2018 03:59 PM Miriam Baglioni

changes to consider modification in code to align to trunck version

53279 26/09/2018 03:59 PM Miriam Baglioni

alignment to trunk version

53262 25/09/2018 05:41 PM Claudio Atzori

avoid to produce duplicated events by eliminating the roots from the comparison process

53261 25/09/2018 04:55 PM Claudio Atzori

broker event serialization test for ProjectEventFactory

53260 25/09/2018 03:26 PM Claudio Atzori

introduced mapping resulttype -> portal url

53213 21/09/2018 12:36 PM Alessia Bardi

Fixed log class name

53191 19/09/2018 05:46 PM Claudio Atzori

avoid collisions when hashing pids by value

53190 19/09/2018 05:33 PM Claudio Atzori

cleaned up unused method, using setDurability in put operation

53119 13/09/2018 06:14 PM Claudio Atzori

updated opentrial input record used in test, fixes #3886

53103 12/09/2018 03:03 PM Claudio Atzori

added mapper and hadoop job configuration file for importing Grid.AC organization data

53080 11/09/2018 06:32 PM Claudio Atzori

integrating bulktag from trunk to beta branch

53079 11/09/2018 06:31 PM Claudio Atzori

integrating bulktag from trunk to beta branch

53078 11/09/2018 06:31 PM Claudio Atzori

integrating bulktag from trunk to beta branch

53068 11/09/2018 03:27 PM Claudio Atzori

rule out invalid dates also on CrossRefToActions

53067 11/09/2018 03:22 PM Claudio Atzori

rule out invalid dates on ScholixToActions

53036 10/09/2018 10:17 AM Claudio Atzori

cleanup

53035 10/09/2018 10:16 AM Claudio Atzori

produce 'supplement' subrel type in case of supplement relationships

53025 05/09/2018 02:33 PM Claudio Atzori

simplified connected component application on the graph

52993 28/08/2018 05:06 PM Sandro La Bruzzo

adding check to understand the bug of wrong relation generated

52985 27/08/2018 10:07 AM Claudio Atzori

do not skip processing datasets in DedupBuildRootsMapper, improved error reporting in DedupBuildRootsReducer

52984 27/08/2018 10:00 AM Claudio Atzori

do not push vertex ids in memory, process them on the fly

52980 23/08/2018 10:54 AM Alessia Bardi

fixed name of TDS profile used by test method

52960 08/08/2018 12:36 PM Claudio Atzori

added jobs for predatory journal analysis