Project

General

Profile

Statistics
| Revision:

# Date Author Comment
42623 23/05/2016 03:05 PM Alessia Bardi

updated XML record file used for opentrials testing.

42596 20/05/2016 11:58 AM Claudio Atzori

reverted to use json-java-format 1.2

42591 20/05/2016 11:50 AM Claudio Atzori

reverted to previous revision

42590 20/05/2016 11:47 AM Claudio Atzori

changed WRITE_TO_WAL = true for all jobs writing to HBase tables

42584 19/05/2016 11:20 AM Claudio Atzori

added counters to keep track of the relationships provenance

42544 13/05/2016 11:20 AM Alessia Bardi

[maven-release-plugin] prepare for next development iteration

42543 13/05/2016 11:20 AM Alessia Bardi

[maven-release-plugin] copy for tag dnet-mapreduce-jobs-0.0.8.8

42542 13/05/2016 11:20 AM Alessia Bardi

[maven-release-plugin] prepare release dnet-mapreduce-jobs-0.0.8.8

42534 13/05/2016 10:50 AM Alessia Bardi

new tst for claim updates

42509 11/05/2016 06:46 PM Alessia Bardi

updated opentrial sample record

42501 11/05/2016 04:49 PM Claudio Atzori

excluding dateoftransformation from metadata fields, it should be serialised only in the record header

42499 11/05/2016 03:30 PM Alessia Bardi

Added dr:dateOfTransformation to some test XML files.
For publications dr:dateOfCollection must be set.
For datasets dri:dateOfCollections must be set.

42495 10/05/2016 07:17 PM Alessia Bardi

Testing OpenTrials dataset record mapping. Depending on snapshot parent.

42423 03/05/2016 10:39 AM Claudio Atzori

finally I made those scripts decent

42417 02/05/2016 04:39 PM Claudio Atzori

[maven-release-plugin] prepare for next development iteration

42416 02/05/2016 04:39 PM Claudio Atzori

[maven-release-plugin] copy for tag dnet-mapreduce-jobs-0.0.8.7

42415 02/05/2016 04:39 PM Claudio Atzori

[maven-release-plugin] prepare release dnet-mapreduce-jobs-0.0.8.7

42414 02/05/2016 04:32 PM Claudio Atzori

fixed dependencies, depending on released parent

42392 02/05/2016 03:21 PM Claudio Atzori

import cleanup

42391 02/05/2016 03:19 PM Claudio Atzori

reverting, we need less getters

42388 02/05/2016 03:03 PM Claudio Atzori

ignores

42387 02/05/2016 03:01 PM Claudio Atzori

experiments for scoreResult

42386 02/05/2016 02:55 PM Claudio Atzori

score result

42385 02/05/2016 02:54 PM Claudio Atzori

upload

42384 02/05/2016 02:53 PM Claudio Atzori

tests for dedup experiments

42383 02/05/2016 02:48 PM Claudio Atzori

added more getters

42382 02/05/2016 02:47 PM Claudio Atzori

dedup experiments

42381 02/05/2016 02:47 PM Claudio Atzori

added mapper class for hdfs actions

42362 02/05/2016 12:13 PM Claudio Atzori

cleanup

42333 27/04/2016 04:45 PM Alessia Bardi

depending on released mongo-logging

42247 15/04/2016 06:06 PM Claudio Atzori

added Mapper class PromoteActionSetFromHDFS

42224 13/04/2016 05:52 PM Alessia Bardi

Updated pom version to 0.0.8.7

41764 18/03/2016 06:13 PM Claudio Atzori

added anchorStats map-only job

41681 15/03/2016 12:48 PM Claudio Atzori

added counter for DOIs

41649 10/03/2016 03:55 PM Claudio Atzori

removing useless counters

41648 10/03/2016 03:54 PM Claudio Atzori

using most recent dnet-pace-core features

41647 10/03/2016 03:54 PM Claudio Atzori

fixed DedupDeleteRelMapper

41646 10/03/2016 03:53 PM Claudio Atzori

do not export deleted entities

41645 10/03/2016 03:52 PM Claudio Atzori

adapted to the removal of contributors as relationships

41519 02/03/2016 06:42 PM Claudio Atzori

updated scripts

41518 02/03/2016 06:41 PM Claudio Atzori

bumped version

41517 02/03/2016 06:37 PM Claudio Atzori

added utility methods to deal with strings rather than byte[]

41516 02/03/2016 06:31 PM Claudio Atzori

sort merged ids

41515 02/03/2016 06:30 PM Claudio Atzori

log the documents being compared before failing

41477 29/02/2016 03:44 PM Claudio Atzori

test for ARC

41468 29/02/2016 03:22 PM Claudio Atzori

introducing support for projects that doesn't provide a link to a specific fundingpath.

41070 27/01/2016 06:59 PM Claudio Atzori

implemented job and workflow to export the openaire identifiers

41055 27/01/2016 12:05 PM Claudio Atzori

log the number of items clustered on each key

41054 27/01/2016 11:59 AM Claudio Atzori

do not consider deleted entities

40341 11/12/2015 10:19 AM Alessia Bardi

New test for openaire2.0_data compliance for datasets

40332 10/12/2015 06:30 PM Claudio Atzori

bumped version

40331 10/12/2015 06:28 PM Claudio Atzori

updating to dnet-openaire-data-protos:3.5.0

40314 09/12/2015 06:13 PM Claudio Atzori

updated to dnet-openaire-data-protos:3.5.0-SNAPSHOT

40205 02/12/2015 06:13 PM Claudio Atzori

cleanup, extended tests to include new relationships and mapping profiles

40129 27/11/2015 04:19 PM Claudio Atzori

counters

40126 27/11/2015 03:30 PM Claudio Atzori

counter test

40125 27/11/2015 03:29 PM Claudio Atzori

depending on version range

40107 25/11/2015 06:48 PM Claudio Atzori

testing author dedup

40106 25/11/2015 06:47 PM Claudio Atzori

branch offline dedup

40105 25/11/2015 06:46 PM Claudio Atzori

cleanup

40063 20/11/2015 05:51 PM Alessia Bardi

Tests load gthe XSLT from the TDSRule profiles in dnet-openaireplus-profiles

40039 20/11/2015 03:43 PM Alessia Bardi

Back to revision r39888 and updated pom and sh files

40034 20/11/2015 03:41 PM Claudio Atzori

depending on SNAPSHOT parent

40029 20/11/2015 03:37 PM Claudio Atzori

playing with index feeding

40028 20/11/2015 03:36 PM Claudio Atzori

removed

40027 20/11/2015 03:36 PM Claudio Atzori

playing with index feeding

40026 20/11/2015 03:35 PM Claudio Atzori

removing old branch

40024 20/11/2015 03:31 PM Alessia Bardi

bumbed minor version

39973 17/11/2015 05:52 PM Claudio Atzori

make SNAPSHOTs visible to this module

39972 17/11/2015 05:40 PM Claudio Atzori

added possibility to post-process the result stored in the index documents

39888 12/11/2015 04:00 PM Alessia Bardi

ticket #1588 Rename "native" compatibility to "proprietary"

39623 19/10/2015 11:35 AM Michele Artini

use of external properties

39616 16/10/2015 05:21 PM Claudio Atzori

added min distance algorithm, used to identify the connected components (dedup)

39615 16/10/2015 05:20 PM Claudio Atzori

bumped version

39614 16/10/2015 05:19 PM Claudio Atzori

bumped version

39605 16/10/2015 04:34 PM Michele Artini

limit the job to insttitutional pubsrepository

39584 16/10/2015 09:43 AM Michele Artini

counter labels

39567 14/10/2015 11:58 AM Michele Artini

use of Text instead of ImmutableBytesWritable

39562 13/10/2015 04:56 PM Michele Artini

reimplemented calculatePersonDistribution M/R job to consider only the results from pubsrepositories (not journals)

39524 09/10/2015 12:18 PM Claudio Atzori

reuse the same outkey and outvalue objects

39431 01/10/2015 11:07 AM Claudio Atzori

added more mapping tests, using xslt picked from services.openaire

39297 18/09/2015 04:18 PM Claudio Atzori

spring makes me lazy

39290 18/09/2015 03:07 PM Claudio Atzori

added infospace dump mapper

39275 18/09/2015 09:19 AM Claudio Atzori
39222 14/09/2015 06:06 PM Claudio Atzori

added information space export job

38951 02/09/2015 12:58 PM Alessia Bardi

testing umlauts

38950 02/09/2015 12:58 PM Alessia Bardi

testing umlauts

38835 28/08/2015 10:24 AM Claudio Atzori

cleanup

38834 28/08/2015 10:24 AM Claudio Atzori

updated to the new mongodb driver specs

38692 21/08/2015 12:42 PM Alessia Bardi

Null values for FP7 and H2020 specific fields about OA mandate and Data Pilot.

38672 05/08/2015 06:15 PM Alessia Bardi

Generate compress record in OAI store.

38671 05/08/2015 06:13 PM Alessia Bardi

Do not check the status of a record: we assume we have to insert it because the OAI store is built in refresh mode.

38665 04/08/2015 05:33 PM Alessia Bardi

OAIStore with compressed bodies. FCurrently for beta only.

38586 29/07/2015 05:23 PM Claudio Atzori

fixed tests, added new dedup specific jobs

38374 20/07/2015 04:59 PM Claudio Atzori

added implementors for offline dedup person workflow

38324 17/07/2015 03:02 PM Claudio Atzori

cleanup

38322 17/07/2015 11:54 AM Claudio Atzori

cleanup