Project

General

Profile

Statistics
| Revision:
Name Size Revision Age Author Comment
  core 36333 over 9 years Marek Horst #1257 raising oozie.action.max.output.data to 8192
  src 38126 about 9 years Marek Horst #1422 fixing Java Heap Space error while execut...
deploy.info 787 Bytes 32244 almost 10 years Marek Horst introducing embedded integration test entry
pom.xml 4.66 KB 37874 over 9 years Marek Horst #1381 porting pmc citations ingestion from casc...
  • svn:ignore: *.iml .* bin build target

Latest revisions

# Date Author Comment
38126 08/07/2015 06:17 PM Marek Horst

#1422 fixing Java Heap Space error while executing checksum postprocessing worfklow on pmc plaintexts

37874 19/06/2015 02:07 PM Marek Horst

#1381 porting pmc citations ingestion from cascading framework to pig. Moving code from icm-iis-ingest-pmc to icm-iis-transformers including itegration tests, removing obsolete scala code along with unneded dependencies. Switching subworkflow in primary workflow.

37652 08/06/2015 01:37 PM Marek Horst

expecting null affiliations instead of empty array

37651 08/06/2015 01:29 PM Marek Horst

adding missing affiliations field in input data, removing duplicates from outut

37594 29/05/2015 05:16 PM Marek Horst

adding missing affiliations field in integration test expected output

37368 21/05/2015 06:26 PM Marek Horst

#1315 propagating confidenceLevel to DocumentToConceptIds. Updating PIG transformer script by introducing concept identifiers deduplication UDF function picking record with the highest confidence level, introducing unit and integration tests. Propagating changes in document to concepts exporter module.

37360 21/05/2015 02:46 PM Marek Horst

removing obsolete test resources

37347 20/05/2015 06:49 PM Marek Horst

#1329 adding affiliations field in ExtractedDocumentMetadata PMC schema. Metadata extraction code refactoring by extracting code responsible for building Affiliation avro records to AffiliationBuilder class and sharing it with pmc ingestion. Implementing affiliations ingestion functionality in PmcXmlHandler covered with unit tests. Adding affiliations field support in ingest pmc metadata transformer.

36984 06/05/2015 04:09 PM Marek Horst

#1301 skipping transformation when input set to $UNDEFINED$ value

36980 06/05/2015 03:55 PM Marek Horst

#1301 removing redundant schema parameter

View revisions

Also available in: Atom