Project

General

Profile

Statistics
| Revision:

# Date Author Comment
38769 26/08/2015 11:55 AM Marek Horst

merging trunk changes with 20150518_new_funding_model branch

37296 18/05/2015 05:38 PM Marek Horst

#1304 updating both project importer modules, reading from rdb and hbase, by supporting new XML funding tree representation.

37295 18/05/2015 05:31 PM Marek Horst

#1302 introducing 20150518_new_funding_model branch

37093 11/05/2015 11:40 AM Marek Horst

#1302 introducing support for updated project model containing funding tree defined as XML instead of JSON, not enabled yet.

35701 27/03/2015 06:18 AM Mateusz Kobos

Removing usage of working_dir from Java workflow node.

35227 11/03/2015 01:14 PM Marek Horst

#1195 removing obsolete ports docreation and datasetid from hbase mapred import, removing references to those ports in workflow.xml files, updating transformer by removing filtering by datasetid due to decisions made in #1072

34330 06/02/2015 03:18 PM Marek Horst

#1109 fixing building excluded acronym values

34325 06/02/2015 01:56 PM Marek Horst

#1109 utilizing isAcronymValid() method in relational db importer. skipping project grant id whenever code is empty

34324 06/02/2015 01:52 PM Marek Horst

#1109 making isAcronymValid() method public so it could be utilized by relational db importer as well

34323 06/02/2015 01:47 PM Marek Horst

#1109 making isAcronymValid() method public so it could be utilized by relational db importer as well

34322 06/02/2015 01:40 PM Marek Horst

#1109 introducing support for multiple acronym values to be skipped, currently set to 'unknown' and 'undefined' values.

34211 02/02/2015 06:21 PM Marek Horst

#1070 introducing support for multiple context identifiers, replacing import_project_concepts_context_id IIS input parameter with import_project_concepts_context_ids_csv

33124 01/12/2014 07:48 PM Marek Horst

#968 aligning IIS importer with ObjectStore#deliverObjects() API method changes

33120 01/12/2014 03:41 PM Marek Horst

#919 supporting multiple profiles in concept importer, logging error instead of throwing exception when profile not found

33073 28/11/2014 11:14 AM Marek Horst

removing obsolete package

33071 28/11/2014 11:14 AM Marek Horst

#919 introducing Concept schema and importer module producing avro datastore based on XML profile

32826 17/11/2014 03:45 PM Marek Horst

#963 propagating dataset -> mdstore from import to exporting phase: importer produces DocumentToMDStore datasetore utilized by exporter module. Updating transformer definition to handle DocumentToMDStore instead of Identifier schema

32088 03/11/2014 04:23 PM Marek Horst

#118 introducing piwik logs importer module

31842 28/10/2014 03:31 PM Marek Horst

#913 renaming DocumentContentUrl#contentSize to DocumentContentUrl#contentSizeKB changing field type from int to long, importing content size from ObjectStoreFile#fileSizeKB, updating dnet-objectstore-rmi dependency from 1.0.0 to 2.0.1-SNAPSHOT

31836 28/10/2014 02:25 PM Marek Horst

#913 temporarily setting contentSize to -1 in ObjectStore DocumentContentURL importer module until ObjectStore exposes proper size value

31441 20/10/2014 12:22 PM Marek Horst

removing redundant logging

31440 20/10/2014 12:19 PM Marek Horst

logging added: imported -> processed, not all of them were imported

31439 20/10/2014 12:18 PM Marek Horst

logging added: presenting total number of imported records

31438 20/10/2014 11:55 AM Marek Horst

logging added when id type of id value is null and record is not written

31421 17/10/2014 12:40 PM Marek Horst

#883 introducing support for blacklisting object store identifiers

31280 13/10/2014 11:22 AM Marek Horst

bugfixing citations converter by prefixing identifier with 50| prefix which was removed when exporting destination document id in BLOB exporter

31266 10/10/2014 03:32 PM Marek Horst

introducing support for handling update column qualifiers holding inferenced data, disabled by default

31265 10/10/2014 03:29 PM Marek Horst

fixing NullPointerException in citations exporter

31248 09/10/2014 03:33 PM Marek Horst

introducing regex support in result approver to support iis::* kind of provenance, updating workflow definitions with proper regex values

31224 08/10/2014 06:19 PM Marek Horst

#840 moving IdentifierMapping from importer to common package

31219 08/10/2014 06:12 PM Marek Horst

#840 renaming DeduplicationMapping to more generic IdentifierMapping

31165 06/10/2014 06:05 PM Marek Horst

imports cleanup

31164 06/10/2014 05:59 PM Marek Horst

#637 treemap->hashmap, order is not preserved anyway

31163 06/10/2014 05:56 PM Marek Horst

#637 introducing ISLookup based vocabulary importer

30982 01/10/2014 06:23 PM Marek Horst

#433 introducing natural citations ordering

30907 26/09/2014 07:30 PM Marek Horst

#799 updating header name from header to oai:header. Introducing additional check verifying empty id.

29853 25/08/2014 06:06 PM Marek Horst

moving ACM importer to icm-iis-mainworkflows due to extending dependances with cermine, introducing performance tests

29833 22/08/2014 05:27 PM Marek Horst

checking whether trust level is empty before comparing to predefined threshold

29825 22/08/2014 02:24 PM Marek Horst

introducing trust level threshold support when importing information space data

29807 21/08/2014 02:03 PM Marek Horst

introducing shared citation ExtraData XML model in icm-iis-common, implementing citation importer in mapred_import workflow, implementing exporter module

29801 20/08/2014 05:58 PM Marek Horst

supporting $UNDEFINED$ value in IMPORT_INFERENCE_PROVENANCE_BLACKLIST

29515 24/07/2014 05:06 PM Marek Horst

#527 introducing ACM XML dump importer module importing bibliographic references for further citation-matching analysis

29103 14/07/2014 05:01 PM Marek Horst

extending progress log interval from 10 000 to 100 000

28940 08/07/2014 01:41 PM Marek Horst

fixing importing abstract after introducing fieldApprover for all Result fields

28937 08/07/2014 11:55 AM Marek Horst

introducing fieldApprover for all Result fields