Project

General

Profile

Statistics
| Revision:
Name Size Revision Age Author Comment
  avro2json 36984 about 9 years Marek Horst #1301 skipping transformation when input set to...
  citationmatching 36306 about 9 years Marek Horst #1257 dropping schema generation related hacks ...
  common 36455 about 9 years Marek Horst bugfix: adding missing start element
  documentsclassification 36306 about 9 years Marek Horst #1257 dropping schema generation related hacks ...
  documentssimilarity 36306 about 9 years Marek Horst #1257 dropping schema generation related hacks ...
  documentssimilarity_with_fulltext 36306 about 9 years Marek Horst #1257 dropping schema generation related hacks ...
  export 36306 about 9 years Marek Horst #1257 dropping schema generation related hacks ...
  idreplacer 36306 about 9 years Marek Horst #1257 dropping schema generation related hacks ...
  importer 36306 about 9 years Marek Horst #1257 dropping schema generation related hacks ...
  ingest 37347 about 9 years Marek Horst #1329 adding affiliations field in ExtractedDoc...
  metadataextraction 36306 about 9 years Marek Horst #1257 dropping schema generation related hacks ...
  metadatamerger 36306 about 9 years Marek Horst #1257 dropping schema generation related hacks ...
  metricsprimary 36306 about 9 years Marek Horst #1257 dropping schema generation related hacks ...
  referenceextraction 36306 about 9 years Marek Horst #1257 dropping schema generation related hacks ...
  statistics 36306 about 9 years Marek Horst #1257 dropping schema generation related hacks ...
  websiteusage 36306 about 9 years Marek Horst #1257 dropping schema generation related hacks ...

Latest revisions

# Date Author Comment
37347 20/05/2015 06:49 PM Marek Horst

#1329 adding affiliations field in ExtractedDocumentMetadata PMC schema. Metadata extraction code refactoring by extracting code responsible for building Affiliation avro records to AffiliationBuilder class and sharing it with pmc ingestion. Implementing affiliations ingestion functionality in PmcXmlHandler covered with unit tests. Adding affiliations field support in ingest pmc metadata transformer.

36984 06/05/2015 04:09 PM Marek Horst

#1301 skipping transformation when input set to $UNDEFINED$ value

36980 06/05/2015 03:55 PM Marek Horst

#1301 removing redundant schema parameter

36970 06/05/2015 03:08 PM Marek Horst

#1301 introducing generic avro to json transformer

36455 17/04/2015 05:52 PM Marek Horst

bugfix: adding missing start element

36306 10/04/2015 01:03 PM Marek Horst

#1257 dropping schema generation related hacks in all PIG modules, switching to literal schema parameters

35517 19/03/2015 05:59 PM Marek Horst

#1210 introducing generic PIG module filtering inferred data by confidence level

35228 11/03/2015 01:14 PM Marek Horst

#1195 removing obsolete ports docreation and datasetid from hbase mapred import, removing references to those ports in workflow.xml files, updating transformer by removing filtering by datasetid due to decisions made in #1072

35151 06/03/2015 05:34 PM Marek Horst

introducing repetetive ordering of citations by ordering them by citation rawText

34993 03/03/2015 02:36 PM Marek Horst

#1169 fixing duplicate context issue, introducing integration test proving implemented solution works properly

View revisions

Also available in: Atom