Project

General

Profile

Statistics
| Revision:
Name Size Revision Age Author Comment
  java 37368 about 9 years Marek Horst #1315 propagating confidenceLevel to DocumentTo...
  resources 37368 about 9 years Marek Horst #1315 propagating confidenceLevel to DocumentTo...

Latest revisions

# Date Author Comment
37368 21/05/2015 06:26 PM Marek Horst

#1315 propagating confidenceLevel to DocumentToConceptIds. Updating PIG transformer script by introducing concept identifiers deduplication UDF function picking record with the highest confidence level, introducing unit and integration tests. Propagating changes in document to concepts exporter module.

37347 20/05/2015 06:49 PM Marek Horst

#1329 adding affiliations field in ExtractedDocumentMetadata PMC schema. Metadata extraction code refactoring by extracting code responsible for building Affiliation avro records to AffiliationBuilder class and sharing it with pmc ingestion. Implementing affiliations ingestion functionality in PmcXmlHandler covered with unit tests. Adding affiliations field support in ingest pmc metadata transformer.

36984 06/05/2015 04:09 PM Marek Horst

#1301 skipping transformation when input set to $UNDEFINED$ value

36980 06/05/2015 03:55 PM Marek Horst

#1301 removing redundant schema parameter

36970 06/05/2015 03:08 PM Marek Horst

#1301 introducing generic avro to json transformer

36455 17/04/2015 05:52 PM Marek Horst

bugfix: adding missing start element

36306 10/04/2015 01:03 PM Marek Horst

#1257 dropping schema generation related hacks in all PIG modules, switching to literal schema parameters

35517 19/03/2015 05:59 PM Marek Horst

#1210 introducing generic PIG module filtering inferred data by confidence level

35228 11/03/2015 01:14 PM Marek Horst

#1195 removing obsolete ports docreation and datasetid from hbase mapred import, removing references to those ports in workflow.xml files, updating transformer by removing filtering by datasetid due to decisions made in #1072

35151 06/03/2015 05:34 PM Marek Horst

introducing repetetive ordering of citations by ordering them by citation rawText

View revisions

Also available in: Atom