Project

General

Profile

Statistics
| Revision:
Name Size Revision Age Author Comment
  main 39045 almost 9 years Marek Horst #1498 introducing major citations related refac...
  test 39086 almost 9 years Marek Horst renaming test resources to be compliant with wi...

Latest revisions

# Date Author Comment
39086 08/09/2015 01:43 PM Marek Horst

renaming test resources to be compliant with windows file system naming requirements

39045 04/09/2015 11:26 PM Marek Horst

#1498 introducing major citations related refactoring including new generic direct citation matching moved to processing phase, introduced position field in all citations schemas and updated collapser taking position into account when merging citations details coming from 3 variuos sources: fuzzy citationmatching, direct citationmatching, references metadata

38183 13/07/2015 01:37 PM Marek Horst

#1435 making PMC XML parser less strict in terms of expected input elements or attributes: article-type is set to 'unknown' value when attribute not defined in XML main element

38172 13/07/2015 11:30 AM Marek Horst

#1431 fixing PMC XML records parser disallowing null reference type, reference value will be simply omitted

37875 19/06/2015 02:10 PM Marek Horst

#1381 porting pmc citations ingestion from cascading framework to pig. Moving code from icm-iis-ingest-pmc to icm-iis-transformers including itegration tests, removing obsolete scala code along with unneded dependencies. Switching subworkflow in primary workflow.

37830 17/06/2015 09:57 PM Marek Horst

updating job.properties

37813 16/06/2015 02:05 PM Marek Horst

#1370 making pmc ingestion integration tests run on dedicated test cluster istead of embedded mini-oozie container

37356 21/05/2015 12:35 PM Marek Horst

#1329 setting affiliation string as raw text if parser produced empty Element object

37344 20/05/2015 06:49 PM Marek Horst

#1329 adding affiliations field in ExtractedDocumentMetadata PMC schema. Metadata extraction code refactoring by extracting code responsible for building Affiliation avro records to AffiliationBuilder class and sharing it with pmc ingestion. Implementing affiliations ingestion functionality in PmcXmlHandler covered with unit tests. Adding affiliations field support in ingest pmc metadata transformer.

36291 09/04/2015 07:10 PM Marek Horst

#1257 dropping schema generation related hacks in all map-reduce modules, switching to literal schema parameters

View revisions

Also available in: Atom