Project

General

Profile

Statistics
| Revision:

# Date Author Comment
39042 04/09/2015 11:26 PM Marek Horst

#1498 introducing major citations related refactoring including new generic direct citation matching moved to processing phase, introduced position field in all citations schemas and updated collapser taking position into account when merging citations details coming from 3 variuos sources: fuzzy citationmatching, direct citationmatching, references metadata

38999 04/09/2015 11:06 AM Marek Horst

#1302 merging 20150518_new_funding_model branch back to the trunk

37979 26/06/2015 07:46 PM Marek Horst

#1395 WorkflowRuntimeParameters static fields cleanup, moving parameters to dedicated modules to prevent excessing icm-iis-common module modifications

37232 14/05/2015 12:07 PM Marek Horst

#1302 concatenating funder with top level funding when building Project#fundingClass

37093 11/05/2015 11:40 AM Marek Horst

#1302 introducing support for updated project model containing funding tree defined as XML instead of JSON, not enabled yet.

36285 09/04/2015 07:10 PM Marek Horst

#1257 dropping schema generation related hacks in all map-reduce modules, switching to literal schema parameters

35701 27/03/2015 06:18 AM Mateusz Kobos

Removing usage of working_dir from Java workflow node.

35227 11/03/2015 01:14 PM Marek Horst

#1195 removing obsolete ports docreation and datasetid from hbase mapred import, removing references to those ports in workflow.xml files, updating transformer by removing filtering by datasetid due to decisions made in #1072

34692 20/02/2015 07:16 PM Marek Horst

#1133 dropping useless workfing_dir creation for java nodes

34564 18/02/2015 01:54 PM Marek Horst

updating job.properties

34330 06/02/2015 03:18 PM Marek Horst

#1109 fixing building excluded acronym values

34326 06/02/2015 01:59 PM Marek Horst

updating default job properties

34325 06/02/2015 01:56 PM Marek Horst

#1109 utilizing isAcronymValid() method in relational db importer. skipping project grant id whenever code is empty

34324 06/02/2015 01:52 PM Marek Horst

#1109 making isAcronymValid() method public so it could be utilized by relational db importer as well

34323 06/02/2015 01:47 PM Marek Horst

#1109 making isAcronymValid() method public so it could be utilized by relational db importer as well

34322 06/02/2015 01:40 PM Marek Horst

#1109 introducing support for multiple acronym values to be skipped, currently set to 'unknown' and 'undefined' values.

34266 04/02/2015 12:05 PM Marek Horst

updating default job properties

34214 02/02/2015 06:24 PM Marek Horst

#1070 updating import_project_concepts_context_ids_csv default value to "fet-fp7,fet-h2020"

34211 02/02/2015 06:21 PM Marek Horst

#1070 introducing support for multiple context identifiers, replacing import_project_concepts_context_id IIS input parameter with import_project_concepts_context_ids_csv

33961 16/01/2015 04:27 PM Marek Horst

#1065 updating job properties

33164 03/12/2014 04:40 PM Marek Horst

#919 removing context_id default value from workflow.xml definition

33124 01/12/2014 07:48 PM Marek Horst

#968 aligning IIS importer with ObjectStore#deliverObjects() API method changes

33120 01/12/2014 03:41 PM Marek Horst

#919 supporting multiple profiles in concept importer, logging error instead of throwing exception when profile not found

33073 28/11/2014 11:14 AM Marek Horst

removing obsolete package

33071 28/11/2014 11:14 AM Marek Horst

#919 introducing Concept schema and importer module producing avro datastore based on XML profile

32826 17/11/2014 03:45 PM Marek Horst

#963 propagating dataset -> mdstore from import to exporting phase: importer produces DocumentToMDStore datasetore utilized by exporter module. Updating transformer definition to handle DocumentToMDStore instead of Identifier schema

32089 03/11/2014 04:24 PM Marek Horst

#118 introducing piwik logs importer module

32088 03/11/2014 04:23 PM Marek Horst

#118 introducing piwik logs importer module

31842 28/10/2014 03:31 PM Marek Horst

#913 renaming DocumentContentUrl#contentSize to DocumentContentUrl#contentSizeKB changing field type from int to long, importing content size from ObjectStoreFile#fileSizeKB, updating dnet-objectstore-rmi dependency from 1.0.0 to 2.0.1-SNAPSHOT

31836 28/10/2014 02:25 PM Marek Horst

#913 temporarily setting contentSize to -1 in ObjectStore DocumentContentURL importer module until ObjectStore exposes proper size value

31441 20/10/2014 12:22 PM Marek Horst

removing redundant logging

31440 20/10/2014 12:19 PM Marek Horst

logging added: imported -> processed, not all of them were imported

31439 20/10/2014 12:18 PM Marek Horst

logging added: presenting total number of imported records

31438 20/10/2014 11:55 AM Marek Horst

logging added when id type of id value is null and record is not written

31421 17/10/2014 12:40 PM Marek Horst

#883 introducing support for blacklisting object store identifiers

31280 13/10/2014 11:22 AM Marek Horst

bugfixing citations converter by prefixing identifier with 50| prefix which was removed when exporting destination document id in BLOB exporter

31266 10/10/2014 03:32 PM Marek Horst

introducing support for handling update column qualifiers holding inferenced data, disabled by default

31265 10/10/2014 03:29 PM Marek Horst

fixing NullPointerException in citations exporter

31248 09/10/2014 03:33 PM Marek Horst

introducing regex support in result approver to support iis::* kind of provenance, updating workflow definitions with proper regex values

31224 08/10/2014 06:19 PM Marek Horst

#840 moving IdentifierMapping from importer to common package

31219 08/10/2014 06:12 PM Marek Horst

#840 renaming DeduplicationMapping to more generic IdentifierMapping

31165 06/10/2014 06:05 PM Marek Horst

imports cleanup

31164 06/10/2014 05:59 PM Marek Horst

#637 treemap->hashmap, order is not preserved anyway

31163 06/10/2014 05:56 PM Marek Horst

#637 introducing ISLookup based vocabulary importer

30982 01/10/2014 06:23 PM Marek Horst

#433 introducing natural citations ordering

30924 29/09/2014 02:29 PM Marek Horst

updating default job.properties

30907 26/09/2014 07:30 PM Marek Horst

#799 updating header name from header to oai:header. Introducing additional check verifying empty id.

29853 25/08/2014 06:06 PM Marek Horst

moving ACM importer to icm-iis-mainworkflows due to extending dependances with cermine, introducing performance tests

29836 25/08/2014 12:55 PM Marek Horst

updating default job properties

29834 22/08/2014 05:34 PM Marek Horst

removing 'import' directory creation and removal which was obsolete

29833 22/08/2014 05:27 PM Marek Horst

checking whether trust level is empty before comparing to predefined threshold

29825 22/08/2014 02:24 PM Marek Horst

introducing trust level threshold support when importing information space data

29818 22/08/2014 11:33 AM Marek Horst

extending description

29807 21/08/2014 02:03 PM Marek Horst

introducing shared citation ExtraData XML model in icm-iis-common, implementing citation importer in mapred_import workflow, implementing exporter module

29801 20/08/2014 05:58 PM Marek Horst

supporting $UNDEFINED$ value in IMPORT_INFERENCE_PROVENANCE_BLACKLIST

29520 24/07/2014 06:55 PM Marek Horst

fixing dirs creation: removing obsolete directories

29515 24/07/2014 05:06 PM Marek Horst

#527 introducing ACM XML dump importer module importing bibliographic references for further citation-matching analysis

29103 14/07/2014 05:01 PM Marek Horst

extending progress log interval from 10 000 to 100 000

28940 08/07/2014 01:41 PM Marek Horst

fixing importing abstract after introducing fieldApprover for all Result fields

28937 08/07/2014 11:55 AM Marek Horst

introducing fieldApprover for all Result fields

28863 03/07/2014 12:55 PM Marek Horst

updating default job.properties

28860 03/07/2014 12:26 PM Marek Horst

updating default job.properties