Project

General

Profile

Statistics
| Revision:
Name Size Revision Age Author Comment
ConnectedComponentsMapper.java 479 Bytes over 8 years claudio.atzori
ConnectedComponentsReducer.java 3.06 KB 56214 almost 5 years Claudio Atzori print the dedup config string before parsing it
HBaseToSimilarityGraphMapper.java 1.04 KB over 8 years claudio.atzori
MindistSearchMapper.java 1.38 KB 49218 over 6 years Claudio Atzori skip weird cases in CC algo
MindistSearchReducer.java 2.81 KB 49218 over 6 years Claudio Atzori skip weird cases in CC algo
VertexWritable.java 3.91 KB 49218 over 6 years Claudio Atzori skip weird cases in CC algo

Latest revisions

# Date Author Comment
56214 27/06/2019 11:01 AM Claudio Atzori

print the dedup config string before parsing it

53800 15/11/2018 05:38 PM Claudio Atzori

import form master branch

53340 01/10/2018 10:04 AM Claudio Atzori

master branch for deployments @ICM

53288 27/09/2018 01:48 PM Claudio Atzori

reverted to r52985 . Test runs shows we need to rely on the edgeIds produced by the connected components identfication phase instead of the vertexIds

53025 05/09/2018 02:33 PM Claudio Atzori

simplified connected component application on the graph

52985 27/08/2018 10:07 AM Claudio Atzori

do not skip processing datasets in DedupBuildRootsMapper, improved error reporting in DedupBuildRootsReducer

52984 27/08/2018 10:00 AM Claudio Atzori

do not push vertex ids in memory, process them on the fly

50236 03/01/2018 09:22 AM Claudio Atzori

beta

49218 03/10/2017 06:14 PM Claudio Atzori

skip weird cases in CC algo

45318 11/01/2017 03:59 PM Claudio Atzori

codebase used to migrate to java8 the production system

View revisions

Also available in: Atom