Project

General

Profile

Statistics
| Revision:

# Date Author Comment
46128 03/03/2017 03:30 PM Claudio Atzori

backported PidClustering from dnet50

43280 19/07/2016 02:37 PM Claudio Atzori

reverted to r42370

42968 20/06/2016 04:09 PM Claudio Atzori

updated to commons.lang3

41642 10/03/2016 03:31 PM Claudio Atzori

introducing structured document distance response

41581 08/03/2016 10:27 AM Claudio Atzori

cleanup

41580 08/03/2016 10:26 AM Claudio Atzori

avoid early optimizations

41504 02/03/2016 06:23 PM Claudio Atzori

configuration refactor, implemented PidMatch condition

41208 08/02/2016 12:30 PM Claudio Atzori

id field can be null, equals and hashcode depend also on the fullname

41138 02/02/2016 04:39 PM Claudio Atzori

introduced condition: mustBeDifferent

41137 02/02/2016 04:30 PM Claudio Atzori

introduced
clustering function: sortedNgramPairs
distance algo: MustBeDifferent

40910 20/01/2016 02:19 PM Claudio Atzori

introduced originalId in GT model, added PersonDistance function

40081 24/11/2015 03:56 PM Claudio Atzori

added case insensitive condition

39087 08/09/2015 02:42 PM Claudio Atzori

added limit for the number of tokens generated from a person fullname

38602 31/07/2015 11:15 AM Claudio Atzori

a list field is empty when all its members are empty

38600 31/07/2015 10:46 AM Claudio Atzori

PersonDistance refinements

38523 27/07/2015 06:08 PM Claudio Atzori

added lastname constraint in person distance (JaroWinkler)

38423 23/07/2015 01:59 PM Claudio Atzori

added configuration for PersonDistance algo

38362 20/07/2015 02:46 PM Claudio Atzori

utility methods and avoid NPEs

38167 13/07/2015 09:41 AM Claudio Atzori

added support for Person objects, model, clustering, distance

38059 06/07/2015 12:06 PM Claudio Atzori

MapDocument implements a more general view of the pace model

37507 27/05/2015 04:15 PM Claudio Atzori

added custom conditions in the proto path navigation

37502 27/05/2015 11:52 AM Claudio Atzori

removed unused method

37304 18/05/2015 06:21 PM Claudio Atzori

added AlwaysMatch distance function

37302 18/05/2015 06:13 PM Claudio Atzori

added person hash clustering function

37300 18/05/2015 06:01 PM Claudio Atzori

added exact match distance

37299 18/05/2015 06:00 PM Claudio Atzori

added ImmutableFieldValue clustering function

37298 18/05/2015 06:00 PM Claudio Atzori

added ImmutableFieldValue clustering function

37195 13/05/2015 02:14 PM Claudio Atzori

lower casing moved to a final cleanup stage, added more distance functions

36863 04/05/2015 09:33 AM Claudio Atzori

reduced default value

36799 28/04/2015 06:08 PM Claudio Atzori

do not alter the fullname

36613 22/04/2015 04:45 PM Claudio Atzori

configuration refactor: jsonization

36254 09/04/2015 03:41 PM Claudio Atzori

added max.children configuration param

36160 08/04/2015 10:36 AM Claudio Atzori

merged from branch configurationId

33135 02/12/2014 04:26 PM Claudio Atzori

merged branch ProtoMapping

33026 27/11/2014 02:13 PM Claudio Atzori

refactored normalization functions, updated cleanup behavior

29734 31/07/2014 02:28 PM Claudio Atzori

fixed blacklist type

29733 31/07/2014 12:49 PM Claudio Atzori

added toString

28981 10/07/2014 09:49 AM Claudio Atzori

extended dedup configuration, including now blacklists and algorithm parameters

28411 24/06/2014 09:59 AM Claudio Atzori

removed protocolbuffers dependency from dnet-pace-core, Builders and Proto specific tests moved in dnet-openaireplus-mapping-utils, adapted dnet-mapreduce-jobs

28090 09/06/2014 03:18 PM Claudio Atzori

merged from branch 1.1.0