Project

General

Profile

Statistics
| Revision:

# Date Author Comment
49149 28/09/2017 10:17 AM Claudio Atzori

do not fail when the vertex does not contian edges

44670 28/11/2016 10:35 AM Claudio Atzori

avoiding to fail with NPE in case of missing relClass(es)

44467 14/11/2016 12:24 PM Claudio Atzori

refactor: easier way to build dedup rels

44130 17/10/2016 04:34 PM Claudio Atzori

align with latest dnet-actionmanager-common

43534 31/08/2016 04:06 PM Alessia Bardi

commented out code that sets a counter because we reach the max num of counters and the job fails

42590 20/05/2016 11:47 AM Claudio Atzori

changed WRITE_TO_WAL = true for all jobs writing to HBase tables

42382 02/05/2016 02:47 PM Claudio Atzori

dedup experiments

42381 02/05/2016 02:47 PM Claudio Atzori

added mapper class for hdfs actions

41764 18/03/2016 06:13 PM Claudio Atzori

added anchorStats map-only job

41649 10/03/2016 03:55 PM Claudio Atzori

removing useless counters

41648 10/03/2016 03:54 PM Claudio Atzori

using most recent dnet-pace-core features

41647 10/03/2016 03:54 PM Claudio Atzori

fixed DedupDeleteRelMapper

41646 10/03/2016 03:53 PM Claudio Atzori

do not export deleted entities

41517 02/03/2016 06:37 PM Claudio Atzori

added utility methods to deal with strings rather than byte[]

41515 02/03/2016 06:30 PM Claudio Atzori

log the documents being compared before failing

41055 27/01/2016 12:05 PM Claudio Atzori

log the number of items clustered on each key

41054 27/01/2016 11:59 AM Claudio Atzori

do not consider deleted entities

40314 09/12/2015 06:13 PM Claudio Atzori

updated to dnet-openaire-data-protos:3.5.0-SNAPSHOT

40205 02/12/2015 06:13 PM Claudio Atzori

cleanup, extended tests to include new relationships and mapping profiles

39616 16/10/2015 05:21 PM Claudio Atzori

added min distance algorithm, used to identify the connected components (dedup)

38586 29/07/2015 05:23 PM Claudio Atzori

fixed tests, added new dedup specific jobs

38374 20/07/2015 04:59 PM Claudio Atzori

added implementors for offline dedup person workflow