Project

General

Profile

Statistics
| Revision:
  • svn:ignore: .classpath .settings target .project *.iml upload_*

# Date Author Comment
53715 12/11/2018 11:51 AM Claudio Atzori

using most recent dnet-openaireplus-mapping utils version (6.2.18)

53710 09/11/2018 07:01 PM Claudio Atzori

added Mapper and Reducer class for infoSpace counts workflows

53687 09/11/2018 02:17 PM Claudio Atzori

added reducer to produce counts on the infospace

53684 09/11/2018 12:28 PM Claudio Atzori

added reducer to produce counts on the infospace

53628 05/11/2018 04:41 PM Claudio Atzori

cannot use Guava's Splitter.splitToList, must stick to basic split method. Classpath is messed up

53604 31/10/2018 11:34 AM Claudio Atzori

implemented ConfigurableExportMapper

53602 31/10/2018 10:36 AM Sandro La Bruzzo

Removed un-used import

53592 30/10/2018 12:36 PM Sandro La Bruzzo

fixed test

53591 30/10/2018 12:30 PM Sandro La Bruzzo

updated Mapper to return the whole invalid record

53589 30/10/2018 12:27 PM Claudio Atzori

export invalid xml records

53588 30/10/2018 12:26 PM Sandro La Bruzzo

refactored Action

53572 29/10/2018 10:12 AM Claudio Atzori

Map only job that produces [openaireId, doi] pairs of records containing invalid characters

53571 29/10/2018 10:10 AM Claudio Atzori

reduced file size

53565 26/10/2018 03:40 PM Sandro La Bruzzo

added parameter to filter only organization in DOIBoostToAction

53556 25/10/2018 04:30 PM Sandro La Bruzzo

fixed problem of missing name in authors

53554 25/10/2018 12:26 PM Sandro La Bruzzo

merged beta branch to master

53518 18/10/2018 02:48 PM Claudio Atzori

introduced use of BlockProcessor

53467 15/10/2018 03:23 PM Claudio Atzori

[maven-release-plugin] prepare for next development iteration

53465 15/10/2018 03:23 PM Claudio Atzori

[maven-release-plugin] prepare release dnet-mapreduce-jobs-1.1.8-MASTER

53461 15/10/2018 12:57 PM Claudio Atzori

updated test for hbase mapping for organizations

53437 10/10/2018 06:43 PM Claudio Atzori

bumped version, dnet-openaireplus-mapping-utils:6.2.15 should fix the unmapped instancetype terms

53436 10/10/2018 06:39 PM Claudio Atzori

updated dnet-openaireplus-mapping-utils dependency version

53421 09/10/2018 11:55 AM Miriam Baglioni

fixed issue when country information is not present for datasource

53419 08/10/2018 10:14 AM Miriam Baglioni

change throwing of exception with counters

53418 06/10/2018 07:49 PM Claudio Atzori

using updated mapping-utils module

53410 05/10/2018 04:18 PM Miriam Baglioni

change parameter from ImmutableBytesWritable to Text

53409 05/10/2018 04:17 PM Miriam Baglioni

refactoring and change of counters

53408 05/10/2018 04:09 PM Claudio Atzori

using updated mapping-utils module, added unit test to check the merge procedure for context and country updates

53407 05/10/2018 04:06 PM Claudio Atzori

rollback wrong commit

53386 04/10/2018 03:35 PM Claudio Atzori

fixing and testing propagation implementation

53383 04/10/2018 02:46 PM Miriam Baglioni

reducer for country propagation that writes on hdfs

53380 03/10/2018 04:19 PM Miriam Baglioni
53371 03/10/2018 10:22 AM Claudio Atzori

cleanup pid types in order to make them valid attributes

53369 02/10/2018 02:08 PM Miriam Baglioni
53364 02/10/2018 11:32 AM Alessia Bardi

Need to set resourceTypeGeneral to clinicalTrial as this is where the IIS can distinguish clinical trial records from "normal dataset"

53362 02/10/2018 10:18 AM Miriam Baglioni

added code for propagation of countries from institutional organization

53345 01/10/2018 10:12 AM Claudio Atzori

[maven-release-plugin] prepare for next development iteration

53343 01/10/2018 10:12 AM Claudio Atzori

[maven-release-plugin] prepare release dnet-mapreduce-jobs-1.1.6-MASTER

53341 01/10/2018 10:05 AM Claudio Atzori

updated pom, master branch

53340 01/10/2018 10:04 AM Claudio Atzori

master branch for deployments @ICM

53336 01/10/2018 09:26 AM Claudio Atzori

why parse strings as Floats?

53288 27/09/2018 01:48 PM Claudio Atzori

reverted to r52985 . Test runs shows we need to rely on the edgeIds produced by the connected components identfication phase instead of the vertexIds

53280 26/09/2018 03:59 PM Miriam Baglioni

changes to consider modification in code to align to trunck version

53279 26/09/2018 03:59 PM Miriam Baglioni

alignment to trunk version

53262 25/09/2018 05:41 PM Claudio Atzori

avoid to produce duplicated events by eliminating the roots from the comparison process

53261 25/09/2018 04:55 PM Claudio Atzori

broker event serialization test for ProjectEventFactory

53260 25/09/2018 03:26 PM Claudio Atzori

introduced mapping resulttype -> portal url

53216 21/09/2018 12:42 PM Alessia Bardi

[maven-release-plugin] prepare for next development iteration

53214 21/09/2018 12:42 PM Alessia Bardi

[maven-release-plugin] prepare release dnet-mapreduce-jobs-1.1.5-BETA

53213 21/09/2018 12:36 PM Alessia Bardi

Fixed log class name

53191 19/09/2018 05:46 PM Claudio Atzori

avoid collisions when hashing pids by value

53190 19/09/2018 05:33 PM Claudio Atzori

cleaned up unused method, using setDurability in put operation

53119 13/09/2018 06:14 PM Claudio Atzori

updated opentrial input record used in test, fixes #3886

53103 12/09/2018 03:03 PM Claudio Atzori

added mapper and hadoop job configuration file for importing Grid.AC organization data

53088 11/09/2018 06:52 PM Claudio Atzori

using mapping-utils version 6.2.11

53080 11/09/2018 06:32 PM Claudio Atzori

integrating bulktag from trunk to beta branch

53079 11/09/2018 06:31 PM Claudio Atzori

integrating bulktag from trunk to beta branch

53078 11/09/2018 06:31 PM Claudio Atzori

integrating bulktag from trunk to beta branch

53068 11/09/2018 03:27 PM Claudio Atzori

rule out invalid dates also on CrossRefToActions

53067 11/09/2018 03:22 PM Claudio Atzori

rule out invalid dates on ScholixToActions

53036 10/09/2018 10:17 AM Claudio Atzori

cleanup

53035 10/09/2018 10:16 AM Claudio Atzori

produce 'supplement' subrel type in case of supplement relationships

53025 05/09/2018 02:33 PM Claudio Atzori

simplified connected component application on the graph

53014 03/09/2018 02:56 PM Claudio Atzori

updated dependencies: dnet-pace-core

52993 28/08/2018 05:06 PM Sandro La Bruzzo

adding check to understand the bug of wrong relation generated

52985 27/08/2018 10:07 AM Claudio Atzori

do not skip processing datasets in DedupBuildRootsMapper, improved error reporting in DedupBuildRootsReducer

52984 27/08/2018 10:00 AM Claudio Atzori

do not push vertex ids in memory, process them on the fly

52980 23/08/2018 10:54 AM Alessia Bardi

fixed name of TDS profile used by test method

52960 08/08/2018 12:36 PM Claudio Atzori

added jobs for predatory journal analysis

52959 07/08/2018 06:17 PM Sandro La Bruzzo

removed warning

52958 07/08/2018 06:15 PM Sandro La Bruzzo

added invisible setup

52957 07/08/2018 06:12 PM Sandro La Bruzzo

refactored Action

52956 07/08/2018 06:07 PM Sandro La Bruzzo

fixed null element

52955 07/08/2018 05:51 PM Sandro La Bruzzo

Created CrossrefImportMapper

52951 07/08/2018 05:30 PM Sandro La Bruzzo

add CrossRefToAction

52949 07/08/2018 03:17 PM Claudio Atzori

bumped dependency version

52935 07/08/2018 11:29 AM Claudio Atzori

fixed mapping from scholix to openaire model

52931 07/08/2018 09:39 AM Claudio Atzori

small fixes

52930 06/08/2018 06:35 PM Sandro La Bruzzo

changed key type

52929 06/08/2018 06:29 PM Sandro La Bruzzo

changed key type

52916 06/08/2018 05:32 PM Sandro La Bruzzo

implemented mapper writing

52915 06/08/2018 04:52 PM Sandro La Bruzzo

added configuration

52912 06/08/2018 04:09 PM Sandro La Bruzzo

added Mapper for tranform scholexplorer links into actionsets

52883 02/08/2018 04:25 PM Claudio Atzori

deprecation: use setDurability instead of setWriteToWAL

52878 02/08/2018 02:19 PM Claudio Atzori

introduced subType in pace wf configuration

52823 25/07/2018 04:10 PM Claudio Atzori

adjusted ids export procedure

52805 24/07/2018 05:22 PM Claudio Atzori

avoid to emit enrichment events when the similarity score is below the threshold

52804 24/07/2018 02:56 PM Claudio Atzori

avoid to emit enrichment events when the similarity score is below the threshold

52803 24/07/2018 02:53 PM Claudio Atzori

avoid to emit enrichment events when the similarity score is below the threshold

52802 24/07/2018 02:04 PM Claudio Atzori

javadoc and test

52801 24/07/2018 12:14 PM Claudio Atzori

indentation

52797 23/07/2018 04:10 PM Claudio Atzori

pick the 1st instance to avoid collisions

52777 20/07/2018 04:04 PM Claudio Atzori

improved behaviour EventWrapperTest

52776 20/07/2018 03:18 PM Michele Artini
52775 20/07/2018 03:07 PM Michele Artini

Partial implementation of a unit test

52773 20/07/2018 11:38 AM Michele Artini

Partial implementation of a unit test

52765 18/07/2018 11:45 AM Michele Artini

Fixed the generation of eventIds

52751 13/07/2018 05:38 PM Alessia Bardi

Workaround for CLARIN mining issue: #3670#note-29

52716 09/07/2018 12:59 PM Claudio Atzori

depending on dnet-openaireplus-mapping-utils:6.2.8

52710 06/07/2018 04:11 PM Claudio Atzori

depending on dnet-openaireplus-mapping-utils:6.2.7