Project

General

Profile

Statistics
| Revision:

# Date Author Comment
32826 17/11/2014 03:45 PM Marek Horst

#963 propagating dataset -> mdstore from import to exporting phase: importer produces DocumentToMDStore datasetore utilized by exporter module. Updating transformer definition to handle DocumentToMDStore instead of Identifier schema

32379 10/11/2014 12:58 PM Marek Horst

updating mdstore test

32241 05/11/2014 05:31 PM Marek Horst

introducing embedded integration test entry

32089 03/11/2014 04:24 PM Marek Horst

#118 introducing piwik logs importer module

32088 03/11/2014 04:23 PM Marek Horst

#118 introducing piwik logs importer module

31847 28/10/2014 03:53 PM Marek Horst

#913 updating dnet-objectstore-rmi dependency from 2.0.1-SNAPSHOT to 2.0.0

31842 28/10/2014 03:31 PM Marek Horst

#913 renaming DocumentContentUrl#contentSize to DocumentContentUrl#contentSizeKB changing field type from int to long, importing content size from ObjectStoreFile#fileSizeKB, updating dnet-objectstore-rmi dependency from 1.0.0 to 2.0.1-SNAPSHOT

31836 28/10/2014 02:25 PM Marek Horst

#913 temporarily setting contentSize to -1 in ObjectStore DocumentContentURL importer module until ObjectStore exposes proper size value

31441 20/10/2014 12:22 PM Marek Horst

removing redundant logging

31440 20/10/2014 12:19 PM Marek Horst

logging added: imported -> processed, not all of them were imported

31439 20/10/2014 12:18 PM Marek Horst

logging added: presenting total number of imported records

31438 20/10/2014 11:55 AM Marek Horst

logging added when id type of id value is null and record is not written

31421 17/10/2014 12:40 PM Marek Horst

#883 introducing support for blacklisting object store identifiers

31280 13/10/2014 11:22 AM Marek Horst

bugfixing citations converter by prefixing identifier with 50| prefix which was removed when exporting destination document id in BLOB exporter

31266 10/10/2014 03:32 PM Marek Horst

introducing support for handling update column qualifiers holding inferenced data, disabled by default

31265 10/10/2014 03:29 PM Marek Horst

fixing NullPointerException in citations exporter

31251 09/10/2014 03:33 PM Marek Horst

introducing regex support in result approver to support iis::* kind of provenance, updating workflow definitions with proper regex values

31248 09/10/2014 03:33 PM Marek Horst

introducing regex support in result approver to support iis::* kind of provenance, updating workflow definitions with proper regex values

31224 08/10/2014 06:19 PM Marek Horst

#840 moving IdentifierMapping from importer to common package

31219 08/10/2014 06:12 PM Marek Horst

#840 renaming DeduplicationMapping to more generic IdentifierMapping

31165 06/10/2014 06:05 PM Marek Horst

imports cleanup

31164 06/10/2014 05:59 PM Marek Horst

#637 treemap->hashmap, order is not preserved anyway

31163 06/10/2014 05:56 PM Marek Horst

#637 introducing ISLookup based vocabulary importer

31042 02/10/2014 02:29 PM Marek Horst

introducing cloudera repository in parent container, removing repository definitions from individual IIS modules

31026 02/10/2014 01:28 PM Marek Horst

removing obsolete comment

30982 01/10/2014 06:23 PM Marek Horst

#433 introducing natural citations ordering

30980 01/10/2014 06:20 PM Marek Horst

updating test

30924 29/09/2014 02:29 PM Marek Horst

updating default job.properties

30908 26/09/2014 07:41 PM Marek Horst

#799 aligning dataset importer test with recent changes

30907 26/09/2014 07:30 PM Marek Horst

#799 updating header name from header to oai:header. Introducing additional check verifying empty id.

30894 26/09/2014 01:48 PM Marek Horst

introducing datadump provider for obtaining contents

30772 18/09/2014 04:30 PM Marek Horst

#780 fixing dependecy issues after recent CNR modules release by sticking to released versions of CNR modules in icm-iis-parent-container, icm-iis-import and icm-iis-export-actionmanager modules

30416 17/09/2014 11:06 AM Sandro La Bruzzo

created tag folder for release

29853 25/08/2014 06:06 PM Marek Horst

moving ACM importer to icm-iis-mainworkflows due to extending dependances with cermine, introducing performance tests

29836 25/08/2014 12:55 PM Marek Horst

updating default job properties

29834 22/08/2014 05:34 PM Marek Horst

removing 'import' directory creation and removal which was obsolete

29833 22/08/2014 05:27 PM Marek Horst

checking whether trust level is empty before comparing to predefined threshold

29825 22/08/2014 02:24 PM Marek Horst

introducing trust level threshold support when importing information space data

29818 22/08/2014 11:33 AM Marek Horst

extending description

29807 21/08/2014 02:03 PM Marek Horst

introducing shared citation ExtraData XML model in icm-iis-common, implementing citation importer in mapred_import workflow, implementing exporter module

29801 20/08/2014 05:58 PM Marek Horst

supporting $UNDEFINED$ value in IMPORT_INFERENCE_PROVENANCE_BLACKLIST

29520 24/07/2014 06:55 PM Marek Horst

fixing dirs creation: removing obsolete directories

29515 24/07/2014 05:06 PM Marek Horst

#527 introducing ACM XML dump importer module importing bibliographic references for further citation-matching analysis

29103 14/07/2014 05:01 PM Marek Horst

extending progress log interval from 10 000 to 100 000

28940 08/07/2014 01:41 PM Marek Horst

fixing importing abstract after introducing fieldApprover for all Result fields

28937 08/07/2014 11:55 AM Marek Horst

introducing fieldApprover for all Result fields

28863 03/07/2014 12:55 PM Marek Horst

updating default job.properties

28860 03/07/2014 12:26 PM Marek Horst

updating default job.properties

28767 01/07/2014 05:04 PM Marek Horst

introducing deploy.info file for module icm-iis-import