Project

General

Profile

Statistics
| Revision:

# Date Author Comment
42255 15/04/2016 06:16 PM Alessia Bardi

fixed label for hostedby datasources in the query

42244 15/04/2016 06:02 PM Claudio Atzori

ActionSet related jobNodes and relative beans moved in dnet-openaireplus-workflows (whenever possible)
Added RunMDStorePluginJobNode, will be useful in awhile

42205 12/04/2016 04:53 PM Claudio Atzori

first implementation of the importActionsFromHDFS workflow

42182 11/04/2016 03:34 PM Alessia Bardi

Let's make the stats generation start after the groupEntities job to avoid OutOfMemory errors in the cluster.

42138 05/04/2016 05:59 PM Alessia Bardi

Fixed wrong newline

42137 05/04/2016 05:58 PM Alessia Bardi

Updated DOAJ2db mapping, because the source format was not matching the field names looked up by the XSLT.

42135 05/04/2016 04:16 PM Alessia Bardi

params for file download configurable via the UI by the user

42118 01/04/2016 06:32 PM Claudio Atzori

setting objectstore size in wf env before feeding it

42080 31/03/2016 03:22 PM Claudio Atzori

added param sleepTimeMs to the download workflow template

41999 30/03/2016 04:54 PM Alessia Bardi

The following parameters for file downloads are now configurable (currently as wf system params): numberOfThreads, connectTimeoutMs, readTimeoutMs

41875 24/03/2016 05:59 PM Alessia Bardi

#1868 it is useless to read the response, as the post API returns nothing

41874 24/03/2016 05:55 PM Alessia Bardi

#1868 Wf node and test wf to post on a VRE via the Social Networking Library

41802 22/03/2016 04:58 PM Alessia Bardi

Enabling incremental collection for entityregistry::projects

41801 22/03/2016 04:58 PM Alessia Bardi

Updated description of node

41765 18/03/2016 06:16 PM Alessia Bardi

fixed wf name in repo-hi

41736 18/03/2016 10:49 AM Sandro La Bruzzo

Changed node name from inference to ingestion

41735 18/03/2016 10:48 AM Sandro La Bruzzo

Added new Metawprkflows that copy files from mdstore but in this case change the interpretation of the objectstore

41730 17/03/2016 04:47 PM Claudio Atzori

defined single workflow to aggregate contextes and project metadata from the same mdstore (avoid to collect from remote resources twice)

41703 16/03/2016 06:20 PM Claudio Atzori

still trying to treat missing funding paths as pointers to the funder (NHMRC/ARC). Perhaps this is a good one

41672 14/03/2016 05:29 PM Michele Artini

new version of fct xslts

41651 10/03/2016 03:58 PM Michele Artini

NSF xslts

41628 09/03/2016 02:29 PM Alessia Bardi

#1592: fill hosted by workflow assignable to both entity registries and aggregators of pubs repos

41623 09/03/2016 12:40 PM Alessia Bardi

#1592: the workflow for the collection of the journal list and the update of the hostedBy map can be assigned to data sources with typology aggregator::pubsrepository

41579 07/03/2016 06:36 PM Alessia Bardi

#1592: context workflow only available for project entity registries

41486 01/03/2016 03:58 PM Claudio Atzori

reusing sax reader instance

41485 01/03/2016 03:46 PM Claudio Atzori

more logging

41483 01/03/2016 03:22 PM Claudio Atzori

still trying to treat missing funding paths as pointers to the funder (NHMRC/ARC)

41479 29/02/2016 04:55 PM Claudio Atzori

treating missing funding paths as pointers to the funder (NHMRC/ARC)

41478 29/02/2016 04:07 PM Claudio Atzori

treating missing funding paths as pointers to the funder (NHMRC)

41476 29/02/2016 03:39 PM Claudio Atzori

treating missing funding paths as pointers to the funder (ARC)

41467 29/02/2016 03:15 PM Sandro La Bruzzo

Added DropContent Node, changed download node to allow the refresh of the content

41394 22/02/2016 12:28 PM Michele Artini

Added embargoEndDate field

41392 22/02/2016 11:46 AM Michele Artini

Unit tests

41371 19/02/2016 02:37 PM Claudio Atzori

update to changes in database service api

41340 17/02/2016 05:39 PM Alessia Bardi

Fixed malformed query

41330 16/02/2016 11:29 AM Michele Artini

added "http://" to organization urls if missing, ticket #662

41310 15/02/2016 03:07 PM Sandro La Bruzzo

Implemented mime type selection on workflow : Copy Metadata as Files (publications) from PubsRepository [Inference]

41156 03/02/2016 05:56 PM Alessia Bardi

Updated node to also set namespacePrefix and datasource id in env attributes whose names are in WorkflowConstants. Old attributes are still set, but the mdbuilder should be updated to take the new attrs.
Validation and fill hosted by wf st updated to use the node.

41128 01/02/2016 05:19 PM Alessia Bardi

#1844 Transform the SQL queries to the hostedby table in a template

41125 01/02/2016 10:58 AM Alessia Bardi

Enabling copy files from mdstore for all repos, not only webcrawl.

41096 28/01/2016 06:25 PM Alessia Bardi

added nodes to count moved files from mdstore to objectstore

40952 21/01/2016 05:40 PM Alessia Bardi

#1843: EKT journal list for hostedby

40888 19/01/2016 05:02 PM Alessia Bardi

more priority to openaire2.0_data on the SQL query for the generation of datasource compatibility

40859 18/01/2016 04:33 PM Michele Artini

Added projects and contexts

40830 15/01/2016 02:25 PM Michele Artini

fixed a bug with datasource ids

40581 21/12/2015 05:13 PM Sandro La Bruzzo

fixed wrong nampespace imported

40577 21/12/2015 04:41 PM Sandro La Bruzzo

fixed wrong nampespace imported

40571 21/12/2015 04:15 PM Sandro La Bruzzo

replaced function for generation of MD5 in xslt

40562 21/12/2015 12:40 PM Sandro La Bruzzo

fixed bug on trasformation of pangaea datasets

40560 21/12/2015 11:14 AM Sandro La Bruzzo

Changed MD5 fnuction on DatasetsfromPangaeaIngestion

40559 21/12/2015 10:40 AM Michele Artini

delete from cache policy

40491 16/12/2015 04:02 PM Michele Artini

feed claims node

40485 16/12/2015 02:30 PM Michele Artini

bug fixing

40474 16/12/2015 11:25 AM Michele Artini

exceptions

40473 16/12/2015 11:19 AM Claudio Atzori

fixed typo in sql query

40471 16/12/2015 10:18 AM Michele Artini

some caches and constraints

40469 15/12/2015 06:00 PM Michele Artini

partial implementation of FeedMissingClaimsJobNode

40460 15/12/2015 04:27 PM Michele Artini

added some properties

40455 15/12/2015 03:48 PM Alessia Bardi

updated workflow creation dates

40446 15/12/2015 03:24 PM Michele Artini

fixed mongo db and collection names

40444 15/12/2015 03:20 PM Michele Artini

fixed a property

40431 15/12/2015 02:57 PM Michele Artini

implemented an api to index a document

40430 15/12/2015 02:55 PM Claudio Atzori

integrating r40247 from 4.1.X branch

40428 15/12/2015 02:35 PM Claudio Atzori

fixed descriptions in repo-hi

40419 15/12/2015 10:39 AM Claudio Atzori

merging r40366

40398 14/12/2015 05:22 PM Michele Artini

implementation of the SinglePublicationSubmitter api

40362 11/12/2015 03:47 PM Alessia Bardi

#1749#1749: support option also for aggregators of pubsrepositories

40360 11/12/2015 03:38 PM Alessia Bardi

#1749: repo-hi for pubsrepository with support interpretation

40347 11/12/2015 11:41 AM Alessia Bardi

removed tests for hbase, as they are performed in dnet-mapreduce-jobs instead

40344 11/12/2015 11:12 AM Alessia Bardi

Setting the admin email when creating the metaworkflow for downloading pdfs. The email is

40342 11/12/2015 11:10 AM Alessia Bardi

we do not need the t namespace

40310 09/12/2015 05:58 PM Alessia Bardi

using openaire2.0_data compliance for datasets

40144 30/11/2015 04:29 PM Michele Artini

Added jurisdiction to SFI

40117 27/11/2015 02:33 PM Andrea Mannocci

exception throws added to failing test

40104 25/11/2015 06:35 PM Claudio Atzori

more logging

40098 25/11/2015 11:50 AM Claudio Atzori

using default db name from system property, still overridable on the wf level. Let the patchHostedByNode fail in case of any exception

40075 23/11/2015 04:31 PM Alessia Bardi

do not update extra fields for existing hostedBy API

40008 20/11/2015 12:31 PM Alessia Bardi

validation wf param jobStatusUpdateInterval set to system param instead of user param, so it does not appear in the UI as requested by Jochen in #539

40007 20/11/2015 12:28 PM Alessia Bardi

#539: renamed blacklistGuidelines wf param

39991 19/11/2015 10:58 AM Sandro La Bruzzo

updated list of mime type on download workflows

39942 16/11/2015 04:38 PM Alessia Bardi

Cleaned template for validation wf.

39901 13/11/2015 11:10 AM Alessia Bardi

workflow and metaworkflow to load the hbase table to hdfs, without firing the update of any provision backend.

39890 12/11/2015 04:02 PM Alessia Bardi

Ticket #1588 Rename "native" compatibility to "proprietary"

39878 12/11/2015 02:13 PM Michele Artini

reformatting

39877 12/11/2015 02:08 PM Michele Artini

added management of oa_mandate_for_publications field

39864 11/11/2015 02:57 PM Alessia Bardi

validation occurs before transformation, not in parallel

39863 11/11/2015 02:47 PM Michele Artini

removed static files of an old UI

39849 11/11/2015 12:11 PM Alessia Bardi

renamed validation wf template

39846 11/11/2015 11:23 AM Alessia Bardi

Validation as a step of the aggregation workflow.

39839 10/11/2015 04:17 PM Claudio Atzori

updated person dedup WF

39728 28/10/2015 11:18 AM Claudio Atzori

fixed extra character

39720 27/10/2015 05:47 PM Claudio Atzori

moved apply-blacklist workflow location

39562 13/10/2015 04:56 PM Michele Artini

reimplemented calculatePersonDistribution M/R job to consider only the results from pubsrepositories (not journals)

39543 12/10/2015 12:04 PM Claudio Atzori

implemented management for claim updates #1500

39392 28/09/2015 02:15 PM Claudio Atzori

reintroducing queryACM.sql file

39368 24/09/2015 04:13 PM Alessia Bardi

blacklist sqls moved to dedicated module

39363 24/09/2015 02:12 PM Alessia Bardi

removed stats report

39362 24/09/2015 12:28 PM Alessia Bardi

removed white space that prevented thecorrect registration of the meta workflow

39361 24/09/2015 10:50 AM Alessia Bardi

remove UI for the stats report visualization, as we are not generating them anymore and the old collected values have been ported to the data flow monitoring.

39360 24/09/2015 10:47 AM Alessia Bardi

remove generation of stats report now that we have the data flow monitoring.