Project

General

Profile

Statistics
| Revision:

# Date Author Comment
50352 20/01/2018 05:52 PM Eri Katsari

cleaning up mapper code

50350 20/01/2018 05:17 PM Eri Katsari

Adding property flag for dedup

49934 18/11/2017 10:19 PM Eri Katsari

Added inheritence in classes for mappers

49933 18/11/2017 04:22 PM Eri Katsari

removed limes depedency

49932 18/11/2017 04:11 PM Eri Katsari

Adding fields used for tokenizing as proprties

48846 25/08/2017 11:10 AM Eri Katsari

removed redis pass from code and updated getStats action to parse args correctly

48805 06/08/2017 11:29 PM Eri Katsari

Final update- fixed distance algs, added props un wf

48796 03/08/2017 01:43 AM Eri Katsari

Updating similarity comparators

48795 02/08/2017 09:25 PM Eri Katsari

Added support for composite keys, cleaned up code

48561 13/07/2017 01:44 AM Eri Katsari

Fixes on old code

48560 13/07/2017 01:25 AM Eri Katsari

Reverted changes to old code

48559 13/07/2017 12:13 AM Eri Katsari

Reduced max num of records

48558 12/07/2017 11:44 PM Eri Katsari

Downgraded to java 7

48557 12/07/2017 11:23 PM Eri Katsari

Added java 8 support; Updated blocking, fixed issue with token generation

48556 12/07/2017 08:46 PM Eri Katsari

Moved code for block creation inside blocking class

48471 09/07/2017 11:27 PM Eri Katsari

Testing with name only as blocking key

48468 09/07/2017 04:51 PM Eri Katsari

Fixed issue in comparison

48467 09/07/2017 04:49 PM Eri Katsari

Fixed issue in comparison

48466 09/07/2017 03:58 PM Eri Katsari

Added treemap in token blocking ; fixed issue with loop

48465 08/07/2017 07:50 PM Eri Katsari

Updated token blocking to use only specified fields and use composite keys

48228 03/07/2017 12:17 AM Eri Katsari

Changed token blocking to accept year as a token

48227 02/07/2017 11:42 PM Eri Katsari

Changed token blocking to accept year as a token

48226 02/07/2017 06:52 PM Eri Katsari

Changed token blocking to accept year as a token

48225 02/07/2017 10:25 AM Eri Katsari

cleaned up build

47687 26/06/2017 10:00 AM Eri Katsari

resolved conflicts to stable state

47686 25/06/2017 06:35 PM Eri Katsari

Added DatasetComparator class

47681 25/06/2017 06:02 PM Eri Katsari

adding new branches

47680 25/06/2017 06:01 PM Eri Katsari

Final updates and cleanup before benchmarking

47338 24/05/2017 02:08 AM Eri Katsari

Cleaning up code in Frequency counter

47337 24/05/2017 01:18 AM Eri Katsari

Cleaning up code in Frequency counter

46462 27/03/2017 08:45 PM Eri Katsari

Refactor redis utils. Cleaned up classses

46372 20/03/2017 08:47 PM Eri Katsari

Fixed issue with leftover Redis connections in Build mappers

45977 21/02/2017 11:52 PM Eri Katsari

A

45892 15/02/2017 01:08 AM Eri Katsari

A

45891 14/02/2017 09:39 PM Eri Katsari

fixed parsing

45890 14/02/2017 09:25 PM Eri Katsari

finished verification

45880 14/02/2017 12:29 PM Eri Katsari

Trimmed trailing \t in redis records.

45878 14/02/2017 11:53 AM Eri Katsari

Added M/R step for verification

45844 12/02/2017 06:17 PM Eri Katsari

Cleaned up packages, Added check in token blocking for only numeric tokens.

45843 12/02/2017 05:41 PM Eri Katsari

Refactored and cleaned up packages.

45842 12/02/2017 12:34 PM Eri Katsari

Trying storiing target records on map ini interlinking

45821 10/02/2017 08:28 AM Eri Katsari

fixed custom comparator

45809 08/02/2017 11:44 PM Eri Katsari

'updating

45808 08/02/2017 11:43 PM Eri Katsari

updating tests

45807 08/02/2017 11:33 PM Eri Katsari

'testing

45806 08/02/2017 11:30 PM Eri Katsari

'updates'

45800 08/02/2017 04:45 PM Eri Katsari

Updates for CR

45765 07/02/2017 02:20 PM Eri Katsari

FInal working build for linkage.

45764 07/02/2017 01:33 PM Eri Katsari

Fixed delims/parsing in DatasetComparator

45751 07/02/2017 10:34 AM Eri Katsari

removed <> from property names in reducer.

45748 07/02/2017 08:12 AM Eri Katsari

added recuder class

45746 07/02/2017 12:42 AM Eri Katsari

Added new m/r in build for testing lazy write on redcords

45716 05/02/2017 08:48 PM Eri Katsari

Added new m/r in build for testing lazy write on redcords

45713 05/02/2017 02:37 PM Eri Katsari

Added new m/r in build for testing lazy write on redcords

45639 31/01/2017 10:47 AM Eri Katsari

Cleaned up code for building

45638 31/01/2017 10:33 AM Eri Katsari

Cleaned up code for building ; fixed error in blocking

45637 31/01/2017 01:11 AM Eri Katsari

Refactored Build accoring to new parsing.

45636 30/01/2017 10:29 PM Eri Katsari

Cleaned up mappers /reducers;

45626 28/01/2017 12:22 PM Eri Katsari

Refactored Build accoring to new parsing.

45625 28/01/2017 10:06 AM Eri Katsari

Refactored Build accoring to new parsing.

45624 28/01/2017 01:09 AM Eri Katsari

Fnished target and source parsing ; Cleaning up build phase.

45622 27/01/2017 11:44 PM Eri Katsari

Fnished target and source parsing ; Cleaning up build phase.

45621 27/01/2017 10:58 PM Eri Katsari

Fnished target and source parsing

45531 23/01/2017 11:31 AM Eri Katsari

Fnished target parsing

45528 23/01/2017 08:13 AM Eri Katsari

Fnished source parsing

45517 20/01/2017 05:21 PM Eri Katsari

Cleaned up mappers /reducers; Added a step for counting word frequencies in titles.

45062 04/01/2017 10:51 PM Eri Katsari

Cleaned up mappers /reducers; Added a step for counting word frequencies in titles.

45061 04/01/2017 10:30 PM Eri Katsari

Cleaned up mappers /reducers; Added a step for counting word frequencies in titles.

45060 04/01/2017 07:29 PM Eri Katsari

Cleaned up mappers /reducers; Added a step for counting word frequencies in titles.

45056 04/01/2017 12:53 AM Eri Katsari

Cleaned up Limes Reducer

45055 03/01/2017 11:56 PM Eri Katsari

Restored limes reducer

45054 03/01/2017 11:34 PM Eri Katsari

Cleaned up mappers /reducers; Added a step for counting word frequencies in titles.

45052 02/01/2017 11:15 PM Eri Katsari

Cleaned up mappers /reducers; Added a step for counting word frequencies in titles.

45051 02/01/2017 08:34 PM Eri Katsari

Cleaned up mappers /reducers; Added a step for counting word frequencies in titles.

45050 02/01/2017 07:58 PM Eri Katsari

Cleaned up mappers /reducers; Added a step for counting word frequencies in titles.

44370 29/10/2016 10:36 PM Eri Katsari

renamed reducers

44369 29/10/2016 10:33 PM Eri Katsari

added new classes

44368 29/10/2016 10:32 PM Eri Katsari

updates with new output format, optimized code, custom comparator

44326 25/10/2016 11:42 PM Eri Katsari

updates with new output format

44308 25/10/2016 01:25 AM Eri Katsari

clea

44307 25/10/2016 01:16 AM Eri Katsari

clea

44296 24/10/2016 12:23 AM Eri Katsari

fixed stats and paths

44295 23/10/2016 08:59 PM Eri Katsari

cleaned up blocking

44287 21/10/2016 10:52 AM Eri Katsari

'moved

44281 21/10/2016 02:04 AM Eri Katsari

''

44280 21/10/2016 12:58 AM Eri Katsari

Trying out custom output format

44177 18/10/2016 05:01 PM Eri Katsari

Fixed error in Preprocessing

44174 18/10/2016 04:30 PM Eri Katsari

Fixed error in Preprocessing

44137 17/10/2016 09:36 PM Eri Katsari

Fixed error in Preprocessing

44054 13/10/2016 12:31 AM Eri Katsari

added pruning of fields and entities according to mappings

44053 12/10/2016 11:20 PM Eri Katsari

added pruning of fields and entities according to mappings

43952 06/10/2016 12:02 AM Eri Katsari

''

43951 05/10/2016 10:14 PM Eri Katsari

added optimized cache implementation

43950 05/10/2016 10:00 PM Eri Katsari

added optimized cache implementation

43949 05/10/2016 08:03 PM Eri Katsari
43836 27/09/2016 10:18 PM Eri Katsari
43537 31/08/2016 11:55 PM Eri Katsari

fix for parsing error in preprocessing

43536 31/08/2016 11:00 PM Eri Katsari

fixed issue with counters

43526 30/08/2016 08:10 PM Eri Katsari

added test suite for MR

43506 25/08/2016 01:18 AM Eri Katsari

added test suite for MR