dnet45dhp-schemasdnet-hadoopdnet40dnet50
backported PidClustering from dnet50
reverted to r42370
updated to commons.lang3
introducing structured document distance response
cleanup
avoid early optimizations
configuration refactor, implemented PidMatch condition
id field can be null, equals and hashcode depend also on the fullname
introduced condition: mustBeDifferent
introduced clustering function: sortedNgramPairsdistance algo: MustBeDifferent
View revisions
Also available in: Atom