Project

General

Profile

Statistics
| Revision:

# Date Author Comment
34266 04/02/2015 12:05 PM Marek Horst

updating default job properties

34214 02/02/2015 06:24 PM Marek Horst

#1070 updating import_project_concepts_context_ids_csv default value to "fet-fp7,fet-h2020"

34211 02/02/2015 06:21 PM Marek Horst

#1070 introducing support for multiple context identifiers, replacing import_project_concepts_context_id IIS input parameter with import_project_concepts_context_ids_csv

33961 16/01/2015 04:27 PM Marek Horst

#1065 updating job properties

33164 03/12/2014 04:40 PM Marek Horst

#919 removing context_id default value from workflow.xml definition

33124 01/12/2014 07:48 PM Marek Horst

#968 aligning IIS importer with ObjectStore#deliverObjects() API method changes

33120 01/12/2014 03:41 PM Marek Horst

#919 supporting multiple profiles in concept importer, logging error instead of throwing exception when profile not found

33073 28/11/2014 11:14 AM Marek Horst

removing obsolete package

33071 28/11/2014 11:14 AM Marek Horst

#919 introducing Concept schema and importer module producing avro datastore based on XML profile

32826 17/11/2014 03:45 PM Marek Horst

#963 propagating dataset -> mdstore from import to exporting phase: importer produces DocumentToMDStore datasetore utilized by exporter module. Updating transformer definition to handle DocumentToMDStore instead of Identifier schema

32089 03/11/2014 04:24 PM Marek Horst

#118 introducing piwik logs importer module

32088 03/11/2014 04:23 PM Marek Horst

#118 introducing piwik logs importer module

31842 28/10/2014 03:31 PM Marek Horst

#913 renaming DocumentContentUrl#contentSize to DocumentContentUrl#contentSizeKB changing field type from int to long, importing content size from ObjectStoreFile#fileSizeKB, updating dnet-objectstore-rmi dependency from 1.0.0 to 2.0.1-SNAPSHOT

31836 28/10/2014 02:25 PM Marek Horst

#913 temporarily setting contentSize to -1 in ObjectStore DocumentContentURL importer module until ObjectStore exposes proper size value

31441 20/10/2014 12:22 PM Marek Horst

removing redundant logging

31440 20/10/2014 12:19 PM Marek Horst

logging added: imported -> processed, not all of them were imported

31439 20/10/2014 12:18 PM Marek Horst

logging added: presenting total number of imported records

31438 20/10/2014 11:55 AM Marek Horst

logging added when id type of id value is null and record is not written

31421 17/10/2014 12:40 PM Marek Horst

#883 introducing support for blacklisting object store identifiers

31280 13/10/2014 11:22 AM Marek Horst

bugfixing citations converter by prefixing identifier with 50| prefix which was removed when exporting destination document id in BLOB exporter

31266 10/10/2014 03:32 PM Marek Horst

introducing support for handling update column qualifiers holding inferenced data, disabled by default

31265 10/10/2014 03:29 PM Marek Horst

fixing NullPointerException in citations exporter

31248 09/10/2014 03:33 PM Marek Horst

introducing regex support in result approver to support iis::* kind of provenance, updating workflow definitions with proper regex values

31224 08/10/2014 06:19 PM Marek Horst

#840 moving IdentifierMapping from importer to common package

31219 08/10/2014 06:12 PM Marek Horst

#840 renaming DeduplicationMapping to more generic IdentifierMapping

31165 06/10/2014 06:05 PM Marek Horst

imports cleanup

31164 06/10/2014 05:59 PM Marek Horst

#637 treemap->hashmap, order is not preserved anyway

31163 06/10/2014 05:56 PM Marek Horst

#637 introducing ISLookup based vocabulary importer

30982 01/10/2014 06:23 PM Marek Horst

#433 introducing natural citations ordering

30924 29/09/2014 02:29 PM Marek Horst

updating default job.properties

30907 26/09/2014 07:30 PM Marek Horst

#799 updating header name from header to oai:header. Introducing additional check verifying empty id.

29853 25/08/2014 06:06 PM Marek Horst

moving ACM importer to icm-iis-mainworkflows due to extending dependances with cermine, introducing performance tests

29836 25/08/2014 12:55 PM Marek Horst

updating default job properties

29834 22/08/2014 05:34 PM Marek Horst

removing 'import' directory creation and removal which was obsolete

29833 22/08/2014 05:27 PM Marek Horst

checking whether trust level is empty before comparing to predefined threshold

29825 22/08/2014 02:24 PM Marek Horst

introducing trust level threshold support when importing information space data

29818 22/08/2014 11:33 AM Marek Horst

extending description

29807 21/08/2014 02:03 PM Marek Horst

introducing shared citation ExtraData XML model in icm-iis-common, implementing citation importer in mapred_import workflow, implementing exporter module

29801 20/08/2014 05:58 PM Marek Horst

supporting $UNDEFINED$ value in IMPORT_INFERENCE_PROVENANCE_BLACKLIST

29520 24/07/2014 06:55 PM Marek Horst

fixing dirs creation: removing obsolete directories

29515 24/07/2014 05:06 PM Marek Horst

#527 introducing ACM XML dump importer module importing bibliographic references for further citation-matching analysis

29103 14/07/2014 05:01 PM Marek Horst

extending progress log interval from 10 000 to 100 000

28940 08/07/2014 01:41 PM Marek Horst

fixing importing abstract after introducing fieldApprover for all Result fields

28937 08/07/2014 11:55 AM Marek Horst

introducing fieldApprover for all Result fields

28863 03/07/2014 12:55 PM Marek Horst

updating default job.properties

28860 03/07/2014 12:26 PM Marek Horst

updating default job.properties