Project

General

Profile

Statistics
| Revision:
Name Size Revision Age Author Comment
  core 36333 over 9 years Marek Horst #1257 raising oozie.action.max.output.data to 8192
  src 39054 about 9 years Marek Horst #1498 adding missing position field
deploy.info 787 Bytes 32244 almost 10 years Marek Horst introducing embedded integration test entry
pom.xml 4.66 KB 37874 over 9 years Marek Horst #1381 porting pmc citations ingestion from casc...
  • svn:ignore: *.iml .* bin build target .project

Latest revisions

# Date Author Comment
39054 05/09/2015 08:49 PM Marek Horst

#1498 adding missing position field

39051 04/09/2015 11:48 PM Marek Horst

#1498 removing obsolete ingest pmc citation resources

39049 04/09/2015 11:36 PM Marek Horst

#1498 introducing major citations related refactoring including new generic direct citation matching moved to processing phase, introduced position field in all citations schemas and updated collapser taking position into account when merging citations details coming from 3 variuos sources: fuzzy citationmatching, direct citationmatching, references metadata

38870 31/08/2015 08:40 AM Lukasz Dumiszewski

Initial commit - settings and project files added to svn:ignore

38126 08/07/2015 06:17 PM Marek Horst

#1422 fixing Java Heap Space error while executing checksum postprocessing worfklow on pmc plaintexts

37874 19/06/2015 02:07 PM Marek Horst

#1381 porting pmc citations ingestion from cascading framework to pig. Moving code from icm-iis-ingest-pmc to icm-iis-transformers including itegration tests, removing obsolete scala code along with unneded dependencies. Switching subworkflow in primary workflow.

37652 08/06/2015 01:37 PM Marek Horst

expecting null affiliations instead of empty array

37651 08/06/2015 01:29 PM Marek Horst

adding missing affiliations field in input data, removing duplicates from outut

37594 29/05/2015 05:16 PM Marek Horst

adding missing affiliations field in integration test expected output

37368 21/05/2015 06:26 PM Marek Horst

#1315 propagating confidenceLevel to DocumentToConceptIds. Updating PIG transformer script by introducing concept identifiers deduplication UDF function picking record with the highest confidence level, introducing unit and integration tests. Propagating changes in document to concepts exporter module.

View revisions

Also available in: Atom