Project

General

Profile

Statistics
| Revision:
Name Size Revision Age Author Comment
  cc 49218 over 6 years Claudio Atzori skip weird cases in CC algo
  experiment 49029 over 6 years Claudio Atzori getting rid of person entities
  gt 49029 over 6 years Claudio Atzori getting rid of person entities
DedupBuildRootsMapper.java 5.04 KB 53176 over 5 years Claudio Atzori DedupBuildRoots[mapper|reducer] merged implemen...
DedupBuildRootsReducer.java 8.04 KB 53176 over 5 years Claudio Atzori DedupBuildRoots[mapper|reducer] merged implemen...
DedupDeleteRelMapper.java 2.11 KB almost 8 years claudio.atzori
DedupDeleteSimRelMapper.java 1.92 KB over 8 years claudio.atzori
DedupFindRootsMapper.java 4.73 KB almost 8 years claudio.atzori
DedupGrouperMapper.java 1.95 KB almost 8 years claudio.atzori
DedupMapper.java 4.32 KB 52877 over 5 years Claudio Atzori introduced subType in pace wf configuration
DedupMarkDeletedEntityMapper.java 3.58 KB almost 8 years claudio.atzori
DedupPersonBean.java 1.06 KB over 10 years michele.artini
DedupReducer.java 8.44 KB 52852 over 5 years Claudio Atzori deprecation: use setDurability instead of setWr...
DedupRootsToCsvMapper.java 2.61 KB almost 9 years claudio.atzori
DedupRootsToCsvReducer.java 2.56 KB almost 9 years claudio.atzori
DedupSimilarityToHdfsActionsMapper.java 3.07 KB over 7 years claudio.atzori
FindDedupCandidatePersonsReducer.java 3.96 KB almost 8 years claudio.atzori
RootEntity.java 799 Bytes almost 9 years claudio.atzori
SimpleDedupPersonReducer.java 5.76 KB over 7 years alessia.bardi

Latest revisions

# Date Author Comment
53036 10/09/2018 10:17 AM Claudio Atzori

cleanup

53025 05/09/2018 02:33 PM Claudio Atzori

simplified connected component application on the graph

52993 28/08/2018 05:06 PM Sandro La Bruzzo

adding check to understand the bug of wrong relation generated

52985 27/08/2018 10:07 AM Claudio Atzori

do not skip processing datasets in DedupBuildRootsMapper, improved error reporting in DedupBuildRootsReducer

52984 27/08/2018 10:00 AM Claudio Atzori

do not push vertex ids in memory, process them on the fly

52883 02/08/2018 04:25 PM Claudio Atzori

deprecation: use setDurability instead of setWriteToWAL

52878 02/08/2018 02:19 PM Claudio Atzori

introduced subType in pace wf configuration

50270 10/01/2018 05:49 PM Claudio Atzori

getting rid of ugly hacks

50236 03/01/2018 09:22 AM Claudio Atzori

beta

49517 17/10/2017 03:08 PM Claudio Atzori

exclude from the deduplication process results that aren't publications

View revisions

Also available in: Atom