Project

General

Profile

Statistics
| Revision:
Name Size Revision Age Author Comment
  oozie_app 34914 over 9 years Marek Horst #1147 introducing HTML import and HTML plaintex...
README.markdown 566 Bytes about 10 years marek.horst
job.properties 2.37 KB 34893 over 9 years Marek Horst updating job.properties

Latest revisions

# Date Author Comment
34914 27/02/2015 07:34 PM Marek Horst

#1147 introducing HTML import and HTML plaintext ingestion in main workflows: primary and preprocessing

34893 27/02/2015 05:32 PM Marek Horst

updating job.properties

34434 11/02/2015 02:26 PM Marek Horst

#1083 enabling webcrawl ingester module extracting FX field from plaintext before executing project reference extraction

34433 11/02/2015 02:26 PM Marek Horst

updating default job properties

34213 02/02/2015 06:22 PM Marek Horst

#1070 updating import_project_concepts_context_ids_csv default value to "fet-fp7,fet-h2020"

34212 02/02/2015 06:21 PM Marek Horst

#1070 introducing support for multiple context identifiers, replacing import_project_concepts_context_id IIS input parameter with import_project_concepts_context_ids_csv

33184 04/12/2014 04:09 PM Marek Horst

#919 enabling concepts matching for FET projects in mainworkflows: import, export, primary and preprocessing

33098 28/11/2014 04:27 PM Marek Horst

#1022 introducing extracted document metadata collapser at importing phase.
Propagating extracted document mentadata (including PMC ingested metadata) to processing part of workflow what can be exploited by citation matching module.
Introducing citations collapser in last stage of processing phase collapsing ingested citations with matched citations.

32829 17/11/2014 03:45 PM Marek Horst

#963 propagating dataset -> mdstore from import to exporting phase: importer produces DocumentToMDStore datasetore utilized by exporter module. Updating transformer definition to handle DocumentToMDStore instead of Identifier schema

32823 17/11/2014 03:42 PM Marek Horst

updating job.properties

View revisions

Also available in: Atom