dnet45dhp-schemasdnet-hadoopdnet40dnet50
#1260 enabling document to protein databank reference extraction in primary workflow, supporting 3 new parameters: active_referenceextraction_pdb, export_action_set_id_document_pdb, export_referenceextraction_pdb_url_root
#1301 introducing explicit export mode flags: active_export_to_hbase and active_export_to_json. This way both exports can be enabled or both of them can be disabled.
#1308 switching distcp namespace to uri:oozie:distcp-action:0.2
disabling export by setting active_export flag to false. Results will be converted to JSON records
#1301 introducing common/export_to_json and utilizing this subworkflow in both primary and preprocessing workflows executing it when active_export=false which means hbase export is disabled
#118 explicitly defining input_document_websiteusage_similarity parameter. This is not a bug fix because exporter works properly without explicitly defining input port due to propagate-configuration mode but we should have all input port definitions aligned to avoid confusions.
bugfixing existing fault removal which was missing
setting false to remove_sideproducts, otherwise whole workingDir will be erased
#1257 dropping schema generation related hacks in all map-reduce modules, switching to literal schema parameters
View revisions
Also available in: Atom