fixed wf name in repo-hi
Changed node name from inference to ingestion
Added new Metawprkflows that copy files from mdstore but in this case change the interpretation of the objectstore
defined single workflow to aggregate contextes and project metadata from the same mdstore (avoid to collect from remote resources twice)
#1592: fill hosted by workflow assignable to both entity registries and aggregators of pubs repos
#1592: the workflow for the collection of the journal list and the update of the hostedBy map can be assigned to data sources with typology aggregator::pubsrepository
#1592: context workflow only available for project entity registries
Enabling copy files from mdstore for all repos, not only webcrawl.
feed claims node
updated workflow creation dates
integrating r40247 from 4.1.X branch
fixed descriptions in repo-hi
#1749#1749: support option also for aggregators of pubsrepositories
#1749: repo-hi for pubsrepository with support interpretation
Setting the admin email when creating the metaworkflow for downloading pdfs. The email is openaire-fulltexts@openaire.eu
workflow and metaworkflow to load the hbase table to hdfs, without firing the update of any provision backend.
Validation as a step of the aggregation workflow.
updated person dedup WF
fixed extra character
moved apply-blacklist workflow location
reimplemented calculatePersonDistribution M/R job to consider only the results from pubsrepositories (not journals)
implemented management for claim updates #1500
removed white space that prevented thecorrect registration of the meta workflow
remove generation of stats report now that we have the data flow monitoring.
added import information space workflow
added information space export job
Pubs Aggregator workflows can be applied also to pubs catalogues (#1453).
including claimed publication dataset relationships
profiles to run calculate Person Distribution
updated primary iis job profile and workflow to the latest specs
reintroduced basic person dedup workflow
don't close the mesh for persons
using the correct "apply actions" workflow
moved to dnet-deduplication
integrated branch dnet-deduplication
The shadow stats validation report must occur after the shadow search switch.
actions garbage workflow migrated to dnet-deduplication
added selective db2hbase workflow
integrated changes from r35921 of branch 3.0.8.x
Added a first implementation of data flow monitoring
fixed arc
fixed env param name
fixed arcs
Nodes to check that the number of records read from mdstores is the same as the number of results written to HBASE
merge from fundingPaths
wf update to support the exclusion of persons and duplicate records during the OAI feed. Ported from 2.2.1 branch
updated DM meta to include the similarity loading wf
added similarity relationships management: UI, data load workflow
Update wf node description because we are paranoid.
merged rev 34244 from branch 2.2.1: Refactored StatsJobNode to include the new "migrate cache" action, which has been integrated in the prepare public stats wf.
Merged changes from 2.x branch regarding the stats report visualization in UI.
added step to automatically ask for the stats report. Profiles already updated also on dev, beta, and production.
Added repo-hi for typology Datarepository inference
Updated profile creation dates.
fixed action set json structures
MERGE dnet-openaireplus-workflows statsManager branch [33559]:[33654] into trunk
added isLookup endpoint and contextid params to IIS workflows
Added 2 new meta-wf to request the backup and restore actions to the StatsManagerService. Waiting for #914 to understand where exactly in the existing workflows the backup node can be put.
repo-hi file renaming, log set to debug
fixed wrong path, updated resource_identifier
bug fixed
Updated workflow name
Renaming repo-hi
Removed unused workflow "update-repository-size"
implemented pangea by journal workflow
implemented "simulation" mode for hadoop jobs,refactored enable/disable inference modules in IIS main workflow
provision wf fetches the stats conf from the relative hadoopJobProfile
stats conf moved in the resp. HadoopJobConfiguration profile.
implemented hostdby map on DOAj
we need a start node.
Workflows for stats validation and cache refresh added to the stats metaworkflow
findIndex is not a start node
damn peer.adr
refactored copyTable and resetHBaseTable workflows
Added node description.
Added content publishing workflow to automatize the switch of the index and stats shown by the openaire+ portal.
Fixed provision workflow: when the findSearchService node fails it must not go in the sync node waitAll.
changed identifier because collides with another profile
Added Pangaea Workflow datasets by projects
updated expected compliance on download workflow
wf to harvest datacite sets
added objectStoreBlacklistCSV to wf definition
refactored submit job nodes, introducing AdminJobNodes
added doaj as entity registry wf
added waitAll node
added copytable workflow
added workflow to export datasets directly on HBASE (zenodo)
added workflow shortcut to avoid exporting all the md record on hdfs, allows to reuse the intermediate data stored during the previous run
links to external editors in wf details pages
Removed dedup meta workflow that re-use workflows used in the dm metaworkflow. If different metaworkflows re-use the same wfs, those wfs are executed multiple times.
updated layout for the mdstore holding claimed records
derp indentation
renamed wf
updated wf for dataset records
Removed metaworkflow to avoid undesired multiple execution of workflows triggered by the current implementation of msro based on issn notification.
added descriptions
updated wf definition
fixed wf params