Project

General

Profile

Statistics
| Revision:

# Date Author Comment
39047 04/09/2015 11:26 PM Marek Horst

#1498 introducing major citations related refactoring including new generic direct citation matching moved to processing phase, introduced position field in all citations schemas and updated collapser taking position into account when merging citations details coming from 3 variuos sources: fuzzy citationmatching, direct citationmatching, references metadata

37981 26/06/2015 07:46 PM Marek Horst

#1395 WorkflowRuntimeParameters static fields cleanup, moving parameters to dedicated modules to prevent excessing icm-iis-common module modifications

37977 26/06/2015 05:49 PM Marek Horst

#1209 introducing support for trust level thresholds provided as IIS input parameter

37757 12/06/2015 04:35 PM Marek Horst

#1212 updating taxomonies database, introducing acm taxonomy classification, introducing acm classes support in exporter module, updating integration tests

37346 20/05/2015 06:49 PM Marek Horst

#1329 adding affiliations field in ExtractedDocumentMetadata PMC schema. Metadata extraction code refactoring by extracting code responsible for building Affiliation avro records to AffiliationBuilder class and sharing it with pmc ingestion. Implementing affiliations ingestion functionality in PmcXmlHandler covered with unit tests. Adding affiliations field support in ingest pmc metadata transformer.

36292 09/04/2015 07:10 PM Marek Horst

#1257 dropping schema generation related hacks in all map-reduce modules, switching to literal schema parameters

35827 30/03/2015 06:46 PM Marek Horst

#1237 introducing dedicated oozie and job properties dumpers writing all job properties into ${workingDir}/job.properties file

35823 30/03/2015 04:45 PM Marek Horst

#1237 introducing properties dumper process writing all job properties into ${workingDir}/job.properties file

35701 27/03/2015 06:18 AM Mateusz Kobos

Removing usage of working_dir from Java workflow node.

31695 24/10/2014 02:55 PM Marek Horst

renaming import.content.max.file.size workflow input parameter to import.content.max.file.size.mb

31694 24/10/2014 02:52 PM Marek Horst

introducing import.content.max.file.size workflow input parameter

31419 17/10/2014 12:24 PM Marek Horst

#883 introducing import.content.blacklisted.objectstores.csv property

31261 10/10/2014 12:11 PM Marek Horst

introducing IMPORT_MERGE_BODY_WITH_UPDATES flag. When set Oaf retrieved from body should be merged with all available updates holding inferred content

31249 09/10/2014 03:33 PM Marek Horst

introducing regex support in result approver to support iis::* kind of provenance, updating workflow definitions with proper regex values

31162 06/10/2014 05:44 PM Marek Horst

#637 introducing vocabulary related import parameters

31016 02/10/2014 11:56 AM Marek Horst

#433 fixing problem with unserializable comparator: moving logic to comparable objects

30984 01/10/2014 06:23 PM Marek Horst

#433 introducing natural citations ordering

29824 22/08/2014 02:15 PM Marek Horst

introducing IMPORT_TRUST_LEVEL_THRESHOLD parameter

29809 21/08/2014 02:03 PM Marek Horst

introducing shared citation ExtraData XML model in icm-iis-common, implementing citation importer in mapred_import workflow, implementing exporter module

29806 21/08/2014 12:56 PM Marek Horst

introducing CONFIDENCE_TO_TRUST_LEVEL_FACTOR field holding 0.9 multiplier used for IIS confidence level [0;1] translation to trust level [0;0.9]

29070 14/07/2014 10:18 AM Marek Horst

introducing FsShellPermissions utility class, utilizing proprly working chmod in metadataextraction cache

28956 08/07/2014 05:18 PM Marek Horst

updating programatical execution of chmod on meta.json file, stil not working due to "Permission denied" warnings

28855 02/07/2014 08:00 PM Marek Horst

introducing changing meta.json file permission allowing writing for different users