dnet40/modules/dnet-mapreduce-jobs/trunk/src/main/java/eu/dnetlib/data/mapreduce/hbase/dedup @ 49149
Name | Size | Revision | Age | Author | Comment |
---|---|---|---|---|---|
cc | 49149 | over 6 years | Claudio Atzori | do not fail when the vertex does not contian edges | |
experiment | almost 8 years | claudio.atzori | |||
gt | 42590 | almost 8 years | Claudio Atzori | changed WRITE_TO_WAL = true for all jobs writin... | |
preprocess | 38586 | over 8 years | Claudio Atzori | fixed tests, added new dedup specific jobs | |
DedupBuildRootsMapper.java | 5.97 KB | 41517 | about 8 years | Claudio Atzori | added utility methods to deal with strings rath... |
DedupBuildRootsReducer.java | 8.47 KB | 44670 | over 7 years | Claudio Atzori | avoiding to fail with NPE in case of missing re... |
DedupDeleteRelMapper.java | 2.11 KB | 42590 | almost 8 years | Claudio Atzori | changed WRITE_TO_WAL = true for all jobs writin... |
DedupDeleteSimRelMapper.java | 1.92 KB | almost 9 years | claudio.atzori | ||
DedupFindRootsMapper.java | 4.73 KB | 42590 | almost 8 years | Claudio Atzori | changed WRITE_TO_WAL = true for all jobs writin... |
DedupFindRootsPersonMapper.java | 3.72 KB | 42590 | almost 8 years | Claudio Atzori | changed WRITE_TO_WAL = true for all jobs writin... |
DedupFindRootsPersonReducer.java | 4.96 KB | 42590 | almost 8 years | Claudio Atzori | changed WRITE_TO_WAL = true for all jobs writin... |
DedupGrouperMapper.java | 1.95 KB | 42590 | almost 8 years | Claudio Atzori | changed WRITE_TO_WAL = true for all jobs writin... |
DedupMapper.java | 3.7 KB | 41054 | over 8 years | Claudio Atzori | do not consider deleted entities |
DedupMarkDeletedEntityMapper.java | 3.58 KB | 42590 | almost 8 years | Claudio Atzori | changed WRITE_TO_WAL = true for all jobs writin... |
DedupPersonBean.java | 1.06 KB | almost 11 years | michele.artini | ||
DedupReducer.java | 8.7 KB | 44467 | over 7 years | Claudio Atzori | refactor: easier way to build dedup rels |
DedupRootsToCsvMapper.java | 2.61 KB | 36796 | about 9 years | Claudio Atzori | csv export of the duplicates original ids |
DedupRootsToCsvReducer.java | 2.56 KB | 36796 | about 9 years | Claudio Atzori | csv export of the duplicates original ids |
DedupSimilarityToHdfsActionsMapper.java | 3.07 KB | 44130 | over 7 years | Claudio Atzori | align with latest dnet-actionmanager-common |
FindDedupCandidatePersonsMapper.java | 1.94 KB | 28308 | almost 10 years | Claudio Atzori | small refactor |
FindDedupCandidatePersonsReducer.java | 3.96 KB | 42590 | almost 8 years | Claudio Atzori | changed WRITE_TO_WAL = true for all jobs writin... |
FindPersonCoauthorsMapper.java | 2.44 KB | almost 9 years | claudio.atzori | ||
FindPersonCoauthorsReducer.java | 93 Bytes | almost 11 years | michele.artini | ||
RootEntity.java | 799 Bytes | 36164 | about 9 years | Claudio Atzori | added dedup roots to csv export job, dedup inde... |
SimpleDedupPersonMapper.java | 2.44 KB | 38059 | almost 9 years | Claudio Atzori | MapDocument implements a more general view of t... |
SimpleDedupPersonReducer.java | 5.76 KB | 43534 | over 7 years | Alessia Bardi | commented out code that sets a counter because ... |
Latest revisions
Also available in: Atom