Project

General

Profile

Statistics
| Revision:

# Date Author Comment
41765 18/03/2016 06:16 PM Alessia Bardi

fixed wf name in repo-hi

41736 18/03/2016 10:49 AM Sandro La Bruzzo

Changed node name from inference to ingestion

41735 18/03/2016 10:48 AM Sandro La Bruzzo

Added new Metawprkflows that copy files from mdstore but in this case change the interpretation of the objectstore

41730 17/03/2016 04:47 PM Claudio Atzori

defined single workflow to aggregate contextes and project metadata from the same mdstore (avoid to collect from remote resources twice)

41628 09/03/2016 02:29 PM Alessia Bardi

#1592: fill hosted by workflow assignable to both entity registries and aggregators of pubs repos

41623 09/03/2016 12:40 PM Alessia Bardi

#1592: the workflow for the collection of the journal list and the update of the hostedBy map can be assigned to data sources with typology aggregator::pubsrepository

41579 07/03/2016 06:36 PM Alessia Bardi

#1592: context workflow only available for project entity registries

41125 01/02/2016 10:58 AM Alessia Bardi

Enabling copy files from mdstore for all repos, not only webcrawl.

40491 16/12/2015 04:02 PM Michele Artini

feed claims node

40455 15/12/2015 03:48 PM Alessia Bardi

updated workflow creation dates

40430 15/12/2015 02:55 PM Claudio Atzori

integrating r40247 from 4.1.X branch

40428 15/12/2015 02:35 PM Claudio Atzori

fixed descriptions in repo-hi

40362 11/12/2015 03:47 PM Alessia Bardi

#1749#1749: support option also for aggregators of pubsrepositories

40360 11/12/2015 03:38 PM Alessia Bardi

#1749: repo-hi for pubsrepository with support interpretation

40344 11/12/2015 11:12 AM Alessia Bardi

Setting the admin email when creating the metaworkflow for downloading pdfs. The email is

39901 13/11/2015 11:10 AM Alessia Bardi

workflow and metaworkflow to load the hbase table to hdfs, without firing the update of any provision backend.

39846 11/11/2015 11:23 AM Alessia Bardi

Validation as a step of the aggregation workflow.

39839 10/11/2015 04:17 PM Claudio Atzori

updated person dedup WF

39728 28/10/2015 11:18 AM Claudio Atzori

fixed extra character

39720 27/10/2015 05:47 PM Claudio Atzori

moved apply-blacklist workflow location

39562 13/10/2015 04:56 PM Michele Artini

reimplemented calculatePersonDistribution M/R job to consider only the results from pubsrepositories (not journals)

39543 12/10/2015 12:04 PM Claudio Atzori

implemented management for claim updates #1500

39362 24/09/2015 12:28 PM Alessia Bardi

removed white space that prevented thecorrect registration of the meta workflow

39360 24/09/2015 10:47 AM Alessia Bardi

remove generation of stats report now that we have the data flow monitoring.

39302 18/09/2015 05:31 PM Claudio Atzori

added import information space workflow

39222 14/09/2015 06:06 PM Claudio Atzori

added information space export job

38941 01/09/2015 04:07 PM Alessia Bardi

Pubs Aggregator workflows can be applied also to pubs catalogues (#1453).

38825 28/08/2015 10:12 AM Claudio Atzori

including claimed publication dataset relationships

37779 15/06/2015 11:41 AM Michele Artini

profiles to run calculate Person Distribution

37649 05/06/2015 06:51 PM Claudio Atzori

updated primary iis job profile and workflow to the latest specs

37501 27/05/2015 11:32 AM Claudio Atzori

reintroduced basic person dedup workflow

37421 22/05/2015 05:46 PM Claudio Atzori

don't close the mesh for persons

37244 14/05/2015 02:51 PM Claudio Atzori

using the correct "apply actions" workflow

37237 14/05/2015 12:19 PM Claudio Atzori

moved to dnet-deduplication

37200 13/05/2015 02:29 PM Claudio Atzori

integrated branch dnet-deduplication

36711 24/04/2015 05:01 PM Alessia Bardi

The shadow stats validation report must occur after the shadow search switch.

36648 23/04/2015 01:11 PM Claudio Atzori

actions garbage workflow migrated to dnet-deduplication

36301 10/04/2015 11:40 AM Claudio Atzori

added selective db2hbase workflow

36274 09/04/2015 05:15 PM Alessia Bardi

integrated changes from r35921 of branch 3.0.8.x

36217 08/04/2015 06:03 PM Andrea Mannocci

Added a first implementation of data flow monitoring

35857 31/03/2015 04:00 PM Alessia Bardi

fixed arc

35853 31/03/2015 03:26 PM Alessia Bardi

fixed env param name

35830 30/03/2015 06:52 PM Alessia Bardi

fixed arcs

35829 30/03/2015 06:51 PM Alessia Bardi

Nodes to check that the number of records read from mdstores is the same as the number of results written to HBASE

35176 09/03/2015 02:37 PM Michele Artini

merge from fundingPaths

34397 10/02/2015 12:51 PM Alessia Bardi

wf update to support the exclusion of persons and duplicate records during the OAI feed. Ported from 2.2.1 branch

34381 10/02/2015 10:41 AM Claudio Atzori

updated DM meta to include the similarity loading wf

34379 10/02/2015 10:34 AM Claudio Atzori

added similarity relationships management: UI, data load workflow

34278 04/02/2015 07:08 PM Alessia Bardi

Update wf node description because we are paranoid.

34245 03/02/2015 04:07 PM Alessia Bardi

merged rev 34244 from branch 2.2.1: Refactored StatsJobNode to include the new "migrate cache" action, which has been integrated in the prepare public stats wf.

34063 22/01/2015 07:12 PM Alessia Bardi

Merged changes from 2.x branch regarding the stats report visualization in UI.

34009 20/01/2015 06:24 PM Alessia Bardi

added step to automatically ask for the stats report. Profiles already updated also on dev, beta, and production.

33947 16/01/2015 11:09 AM Sandro La Bruzzo

Added repo-hi for typology Datarepository inference

33704 22/12/2014 12:46 PM Alessia Bardi

Updated profile creation dates.

33663 17/12/2014 06:52 PM Claudio Atzori

fixed action set json structures

33655 17/12/2014 03:47 PM Alessia Bardi

MERGE dnet-openaireplus-workflows statsManager branch [33559]:[33654] into trunk

33390 15/12/2014 10:49 AM Claudio Atzori

added isLookup endpoint and contextid params to IIS workflows

33378 12/12/2014 05:30 PM Alessia Bardi

Added 2 new meta-wf to request the backup and restore actions to the StatsManagerService. Waiting for #914 to understand where exactly in the existing workflows the backup node can be put.

33109 01/12/2014 11:50 AM Sandro La Bruzzo

repo-hi file renaming, log set to debug

33075 28/11/2014 12:45 PM Sandro La Bruzzo

fixed wrong path, updated resource_identifier

33045 27/11/2014 05:04 PM Sandro La Bruzzo

bug fixed

33028 27/11/2014 02:55 PM Alessia Bardi

Updated workflow name

33011 27/11/2014 12:55 PM Sandro La Bruzzo

Renaming repo-hi

32964 25/11/2014 10:15 AM Alessia Bardi

Removed unused workflow "update-repository-size"

32940 21/11/2014 05:16 PM Sandro La Bruzzo

implemented pangea by journal workflow

32863 18/11/2014 04:32 PM Claudio Atzori

implemented "simulation" mode for hadoop jobs,
refactored enable/disable inference modules in IIS main workflow

32842 17/11/2014 06:37 PM Claudio Atzori

provision wf fetches the stats conf from the relative hadoopJobProfile

32841 17/11/2014 06:35 PM Claudio Atzori

stats conf moved in the resp. HadoopJobConfiguration profile.

32838 17/11/2014 06:02 PM Sandro La Bruzzo

implemented hostdby map on DOAj

32323 07/11/2014 02:41 PM Alessia Bardi

we need a start node.

32322 07/11/2014 02:00 PM Alessia Bardi

Workflows for stats validation and cache refresh added to the stats metaworkflow

32254 05/11/2014 06:56 PM Alessia Bardi

findIndex is not a start node

32208 05/11/2014 12:30 PM Claudio Atzori

damn peer.adr

32183 04/11/2014 07:54 PM Claudio Atzori

refactored copyTable and resetHBaseTable workflows

32181 04/11/2014 07:47 PM Alessia Bardi

Added node description.

32177 04/11/2014 06:41 PM Alessia Bardi

Added content publishing workflow to automatize the switch of the index and stats shown by the openaire+ portal.

32029 31/10/2014 11:44 AM Alessia Bardi

Fixed provision workflow: when the findSearchService node fails it must not go in the sync node waitAll.

31937 30/10/2014 09:45 AM Sandro La Bruzzo

changed identifier because collides with another profile

31924 29/10/2014 06:54 PM Sandro La Bruzzo

Added Pangaea Workflow datasets by projects

31824 28/10/2014 12:46 PM Sandro La Bruzzo

updated expected compliance on download workflow

31490 20/10/2014 04:16 PM Michele Artini

wf to harvest datacite sets

31431 17/10/2014 05:08 PM Claudio Atzori

added objectStoreBlacklistCSV to wf definition

31415 17/10/2014 11:53 AM Claudio Atzori

refactored submit job nodes, introducing AdminJobNodes

31284 13/10/2014 12:12 PM Sandro La Bruzzo

added doaj as entity registry wf

31260 10/10/2014 10:17 AM Claudio Atzori

added waitAll node

31213 08/10/2014 05:31 PM Claudio Atzori

added copytable workflow

31072 03/10/2014 09:43 AM Claudio Atzori

added workflow to export datasets directly on HBASE (zenodo)

30948 30/09/2014 04:51 PM Claudio Atzori

added workflow shortcut to avoid exporting all the md record on hdfs, allows to reuse the intermediate data stored during the previous run

30934 29/09/2014 05:34 PM Michele Artini

links to external editors in wf details pages

30844 24/09/2014 03:54 PM Alessia Bardi

Removed dedup meta workflow that re-use workflows used in the dm metaworkflow. If different metaworkflows re-use the same wfs, those wfs are executed multiple times.

30841 24/09/2014 12:32 PM Claudio Atzori

updated layout for the mdstore holding claimed records

30810 22/09/2014 01:11 PM Claudio Atzori

derp indentation

30809 22/09/2014 01:00 PM Claudio Atzori

renamed wf

30808 22/09/2014 12:59 PM Claudio Atzori

updated wf for dataset records

30792 19/09/2014 03:48 PM Alessia Bardi

Removed metaworkflow to avoid undesired multiple execution of workflows triggered by the current implementation of msro based on issn notification.

30785 19/09/2014 12:03 PM Claudio Atzori

added descriptions

30781 19/09/2014 10:35 AM Claudio Atzori

updated wf definition

30717 18/09/2014 12:26 PM Claudio Atzori

fixed wf params