Project

General

Profile

Statistics
| Revision:

# Date Author Comment
39162 10/09/2015 05:37 PM Marek Horst

merging trunk changes with IIS-CDH-5.3.0 branch

39042 04/09/2015 11:26 PM Marek Horst

#1498 introducing major citations related refactoring including new generic direct citation matching moved to processing phase, introduced position field in all citations schemas and updated collapser taking position into account when merging citations details coming from 3 variuos sources: fuzzy citationmatching, direct citationmatching, references metadata

39025 04/09/2015 03:42 PM Mateusz Kobos

Removing old code related to Protocol Buffers

Long time ago, we used Protocol Buffers encapsulated in sequence files as the format for data stores. This cleanup is removing code related to this functionality.

38999 04/09/2015 11:06 AM Marek Horst

#1302 merging 20150518_new_funding_model branch back to the trunk

38769 26/08/2015 11:55 AM Marek Horst

merging trunk changes with 20150518_new_funding_model branch

37979 26/06/2015 07:46 PM Marek Horst

#1395 WorkflowRuntimeParameters static fields cleanup, moving parameters to dedicated modules to prevent excessing icm-iis-common module modifications

37898 19/06/2015 06:54 PM Marek Horst

merging 20150518_new_funding_model branch changes with IIS-CDH-5.3.0 branch in order to support new funding model in IIS-CDH-5.3.0 branch

37895 19/06/2015 06:28 PM Marek Horst

merging trunk changes with 20150518_new_funding_model branch

37879 19/06/2015 03:53 PM Marek Horst

merging trunk changes with IIS-CDH-5.3.0 branch

37753 12/06/2015 04:28 PM Marek Horst

concatenating identifiers with stringutils

37534 28/05/2015 04:30 PM Marek Horst

introducing objectstores provider for manual object store identifiers retrieval

37301 18/05/2015 06:06 PM Marek Horst

#1304 updating project imported unit test

37296 18/05/2015 05:38 PM Marek Horst

#1304 updating both project importer modules, reading from rdb and hbase, by supporting new XML funding tree representation.

37295 18/05/2015 05:31 PM Marek Horst

#1302 introducing 20150518_new_funding_model branch

37232 14/05/2015 12:07 PM Marek Horst

#1302 concatenating funder with top level funding when building Project#fundingClass

37093 11/05/2015 11:40 AM Marek Horst

#1302 introducing support for updated project model containing funding tree defined as XML instead of JSON, not enabled yet.

36943 05/05/2015 07:58 PM Marek Horst

changing hbase-server dependency scope from provided to compile, apparently hbase-server is not available on CDH5 cluster

36942 05/05/2015 06:42 PM Marek Horst

merging trunk changes with IIS-CDH-5.3.0 branch

36285 09/04/2015 07:10 PM Marek Horst

#1257 dropping schema generation related hacks in all map-reduce modules, switching to literal schema parameters

35706 27/03/2015 09:40 AM Marek Horst

#1135 switching icm-iis-parent-container version to 1.0.1-SNAPSHOT in order to include workingDir related changes made in icm-iis-core

35701 27/03/2015 06:18 AM Mateusz Kobos

Removing usage of working_dir from Java workflow node.

35513 19/03/2015 05:09 PM Marek Horst

#1208 upgrading dnet-openaireplus-mapping-utils dependency range to [3.0.0,4.0.0)

35413 17/03/2015 03:04 PM Marek Horst

#1198 aligning IIS dependencies and java code to CDH5.3.0 cluster

35399 17/03/2015 03:01 PM Marek Horst

#1197 introducing job.properties changes aligning paths to rumcajs cluster HDFS structure

35242 11/03/2015 04:41 PM Marek Horst

creating IIS-CDH-5.3.0 branch

35241 11/03/2015 04:41 PM Marek Horst

introducing branches folder

35227 11/03/2015 01:14 PM Marek Horst

#1195 removing obsolete ports docreation and datasetid from hbase mapred import, removing references to those ports in workflow.xml files, updating transformer by removing filtering by datasetid due to decisions made in #1072

34692 20/02/2015 07:16 PM Marek Horst

#1133 dropping useless workfing_dir creation for java nodes

34627 19/02/2015 06:12 PM Marek Horst

#1038 introducing ranges in dependencies definition for all IIS modules

34582 18/02/2015 06:50 PM Marek Horst

#1038 reintroducing ranges in dependencies definition for all non-iis dnet modules

34564 18/02/2015 01:54 PM Marek Horst

updating job.properties

34330 06/02/2015 03:18 PM Marek Horst

#1109 fixing building excluded acronym values

34326 06/02/2015 01:59 PM Marek Horst

updating default job properties

34325 06/02/2015 01:56 PM Marek Horst

#1109 utilizing isAcronymValid() method in relational db importer. skipping project grant id whenever code is empty

34324 06/02/2015 01:52 PM Marek Horst

#1109 making isAcronymValid() method public so it could be utilized by relational db importer as well

34323 06/02/2015 01:47 PM Marek Horst

#1109 making isAcronymValid() method public so it could be utilized by relational db importer as well

34322 06/02/2015 01:40 PM Marek Horst

#1109 introducing support for multiple acronym values to be skipped, currently set to 'unknown' and 'undefined' values.

34266 04/02/2015 12:05 PM Marek Horst

updating default job properties

34214 02/02/2015 06:24 PM Marek Horst

#1070 updating import_project_concepts_context_ids_csv default value to "fet-fp7,fet-h2020"

34211 02/02/2015 06:21 PM Marek Horst

#1070 introducing support for multiple context identifiers, replacing import_project_concepts_context_id IIS input parameter with import_project_concepts_context_ids_csv

33961 16/01/2015 04:27 PM Marek Horst

#1065 updating job properties

33574 16/12/2014 04:13 PM Marek Horst

[maven-release-plugin] prepare for next development iteration

33573 16/12/2014 04:13 PM Marek Horst

[maven-release-plugin] copy for tag icm-iis-import-1.0.0

33572 16/12/2014 04:13 PM Marek Horst

[maven-release-plugin] prepare release icm-iis-import-1.0.0

33571 16/12/2014 04:10 PM Marek Horst

#1044 pre-release switching to released version of parent pom and released dependencies

33412 15/12/2014 12:43 PM Marek Horst

introducing scm definition

33399 15/12/2014 12:27 PM Marek Horst

#1038 upgrading dnet dependencies to latest released versions listed by Claudio in #1038#note-3

33164 03/12/2014 04:40 PM Marek Horst

#919 removing context_id default value from workflow.xml definition

33124 01/12/2014 07:48 PM Marek Horst

#968 aligning IIS importer with ObjectStore#deliverObjects() API method changes

33120 01/12/2014 03:41 PM Marek Horst

#919 supporting multiple profiles in concept importer, logging error instead of throwing exception when profile not found

33073 28/11/2014 11:14 AM Marek Horst

removing obsolete package

33071 28/11/2014 11:14 AM Marek Horst

#919 introducing Concept schema and importer module producing avro datastore based on XML profile

32826 17/11/2014 03:45 PM Marek Horst

#963 propagating dataset -> mdstore from import to exporting phase: importer produces DocumentToMDStore datasetore utilized by exporter module. Updating transformer definition to handle DocumentToMDStore instead of Identifier schema

32379 10/11/2014 12:58 PM Marek Horst

updating mdstore test

32241 05/11/2014 05:31 PM Marek Horst

introducing embedded integration test entry

32089 03/11/2014 04:24 PM Marek Horst

#118 introducing piwik logs importer module

32088 03/11/2014 04:23 PM Marek Horst

#118 introducing piwik logs importer module

31847 28/10/2014 03:53 PM Marek Horst

#913 updating dnet-objectstore-rmi dependency from 2.0.1-SNAPSHOT to 2.0.0

31842 28/10/2014 03:31 PM Marek Horst

#913 renaming DocumentContentUrl#contentSize to DocumentContentUrl#contentSizeKB changing field type from int to long, importing content size from ObjectStoreFile#fileSizeKB, updating dnet-objectstore-rmi dependency from 1.0.0 to 2.0.1-SNAPSHOT

31836 28/10/2014 02:25 PM Marek Horst

#913 temporarily setting contentSize to -1 in ObjectStore DocumentContentURL importer module until ObjectStore exposes proper size value

31441 20/10/2014 12:22 PM Marek Horst

removing redundant logging

31440 20/10/2014 12:19 PM Marek Horst

logging added: imported -> processed, not all of them were imported

31439 20/10/2014 12:18 PM Marek Horst

logging added: presenting total number of imported records

31438 20/10/2014 11:55 AM Marek Horst

logging added when id type of id value is null and record is not written

31421 17/10/2014 12:40 PM Marek Horst

#883 introducing support for blacklisting object store identifiers

31280 13/10/2014 11:22 AM Marek Horst

bugfixing citations converter by prefixing identifier with 50| prefix which was removed when exporting destination document id in BLOB exporter

31266 10/10/2014 03:32 PM Marek Horst

introducing support for handling update column qualifiers holding inferenced data, disabled by default

31265 10/10/2014 03:29 PM Marek Horst

fixing NullPointerException in citations exporter

31251 09/10/2014 03:33 PM Marek Horst

introducing regex support in result approver to support iis::* kind of provenance, updating workflow definitions with proper regex values

31248 09/10/2014 03:33 PM Marek Horst

introducing regex support in result approver to support iis::* kind of provenance, updating workflow definitions with proper regex values

31224 08/10/2014 06:19 PM Marek Horst

#840 moving IdentifierMapping from importer to common package

31219 08/10/2014 06:12 PM Marek Horst

#840 renaming DeduplicationMapping to more generic IdentifierMapping

31165 06/10/2014 06:05 PM Marek Horst

imports cleanup

31164 06/10/2014 05:59 PM Marek Horst

#637 treemap->hashmap, order is not preserved anyway

31163 06/10/2014 05:56 PM Marek Horst

#637 introducing ISLookup based vocabulary importer

31042 02/10/2014 02:29 PM Marek Horst

introducing cloudera repository in parent container, removing repository definitions from individual IIS modules

31026 02/10/2014 01:28 PM Marek Horst

removing obsolete comment

30982 01/10/2014 06:23 PM Marek Horst

#433 introducing natural citations ordering

30980 01/10/2014 06:20 PM Marek Horst

updating test

30924 29/09/2014 02:29 PM Marek Horst

updating default job.properties

30908 26/09/2014 07:41 PM Marek Horst

#799 aligning dataset importer test with recent changes

30907 26/09/2014 07:30 PM Marek Horst

#799 updating header name from header to oai:header. Introducing additional check verifying empty id.

30894 26/09/2014 01:48 PM Marek Horst

introducing datadump provider for obtaining contents

30772 18/09/2014 04:30 PM Marek Horst

#780 fixing dependecy issues after recent CNR modules release by sticking to released versions of CNR modules in icm-iis-parent-container, icm-iis-import and icm-iis-export-actionmanager modules

30416 17/09/2014 11:06 AM Sandro La Bruzzo

created tag folder for release

29853 25/08/2014 06:06 PM Marek Horst

moving ACM importer to icm-iis-mainworkflows due to extending dependances with cermine, introducing performance tests

29836 25/08/2014 12:55 PM Marek Horst

updating default job properties

29834 22/08/2014 05:34 PM Marek Horst

removing 'import' directory creation and removal which was obsolete

29833 22/08/2014 05:27 PM Marek Horst

checking whether trust level is empty before comparing to predefined threshold

29825 22/08/2014 02:24 PM Marek Horst

introducing trust level threshold support when importing information space data

29818 22/08/2014 11:33 AM Marek Horst

extending description

29807 21/08/2014 02:03 PM Marek Horst

introducing shared citation ExtraData XML model in icm-iis-common, implementing citation importer in mapred_import workflow, implementing exporter module

29801 20/08/2014 05:58 PM Marek Horst

supporting $UNDEFINED$ value in IMPORT_INFERENCE_PROVENANCE_BLACKLIST

29520 24/07/2014 06:55 PM Marek Horst

fixing dirs creation: removing obsolete directories

29515 24/07/2014 05:06 PM Marek Horst

#527 introducing ACM XML dump importer module importing bibliographic references for further citation-matching analysis

29103 14/07/2014 05:01 PM Marek Horst

extending progress log interval from 10 000 to 100 000

28940 08/07/2014 01:41 PM Marek Horst

fixing importing abstract after introducing fieldApprover for all Result fields

28937 08/07/2014 11:55 AM Marek Horst

introducing fieldApprover for all Result fields