first implementation of the importActionsFromHDFS workflow
Let's make the stats generation start after the groupEntities job to avoid OutOfMemory errors in the cluster.
Fixed wrong newline
Updated DOAJ2db mapping, because the source format was not matching the field names looked up by the XSLT.
params for file download configurable via the UI by the user
setting objectstore size in wf env before feeding it
[maven-release-plugin] prepare for next development iteration
[maven-release-plugin] prepare release dnet-openaireplus-workflows-5.2.3
added param sleepTimeMs to the download workflow template
[maven-release-plugin] prepare release dnet-openaireplus-workflows-5.2.2
depending on newer version of database service api
The following parameters for file downloads are now configurable (currently as wf system params): numberOfThreads, connectTimeoutMs, readTimeoutMs
#1868 it is useless to read the response, as the post API returns nothing
#1868 Wf node and test wf to post on a VRE via the Social Networking Library
[maven-release-plugin] prepare release dnet-openaireplus-workflows-5.2.1
Enabling incremental collection for entityregistry::projects
Updated description of node
fixed wf name in repo-hi
[maven-release-plugin] prepare release dnet-openaireplus-workflows-5.2.0
bumped versopm
Changed node name from inference to ingestion
Added new Metawprkflows that copy files from mdstore but in this case change the interpretation of the objectstore
defined single workflow to aggregate contextes and project metadata from the same mdstore (avoid to collect from remote resources twice)
still trying to treat missing funding paths as pointers to the funder (NHMRC/ARC). Perhaps this is a good one
new version of fct xslts
NSF xslts
#1592: fill hosted by workflow assignable to both entity registries and aggregators of pubs repos
#1592: the workflow for the collection of the journal list and the update of the hostedBy map can be assigned to data sources with typology aggregator::pubsrepository
#1592: context workflow only available for project entity registries
[maven-release-plugin] prepare release dnet-openaireplus-workflows-5.1.2
reusing sax reader instance
more logging
bumped minor version, still trying to address the issue in the previous commit msg
still trying to treat missing funding paths as pointers to the funder (NHMRC/ARC)
treating missing funding paths as pointers to the funder (NHMRC/ARC)
treating missing funding paths as pointers to the funder (NHMRC)
treating missing funding paths as pointers to the funder (ARC)
Added DropContent Node, changed download node to allow the refresh of the content
fixed the version of cnr-enabling-database-api
removed snapshot dependencies
Added embargoEndDate field
Unit tests
update to changes in database service api
[maven-release-plugin] prepare release dnet-openaireplus-workflows-5.0.13
Fixed malformed query
added "http://" to organization urls if missing, ticket #662
[maven-release-plugin] prepare release dnet-openaireplus-workflows-5.0.12
Implemented mime type selection on workflow : Copy Metadata as Files (publications) from PubsRepository [Inference]
Updated node to also set namespacePrefix and datasource id in env attributes whose names are in WorkflowConstants. Old attributes are still set, but the mdbuilder should be updated to take the new attrs.Validation and fill hosted by wf st updated to use the node.
#1844 Transform the SQL queries to the hostedby table in a template
Enabling copy files from mdstore for all repos, not only webcrawl.
[maven-release-plugin] prepare release dnet-openaireplus-workflows-5.0.11
added nodes to count moved files from mdstore to objectstore
[maven-release-plugin] prepare release dnet-openaireplus-workflows-5.0.10
#1843: EKT journal list for hostedby
[maven-release-plugin] prepare release dnet-openaireplus-workflows-5.0.9
more priority to openaire2.0_data on the SQL query for the generation of datasource compatibility
[maven-release-plugin] prepare release dnet-openaireplus-workflows-5.0.8
Added projects and contexts
[maven-release-plugin] prepare release dnet-openaireplus-workflows-5.0.7
fixed a bug with datasource ids
[maven-release-plugin] prepare release dnet-openaireplus-workflows-5.0.6
fixed wrong nampespace imported
[maven-release-plugin] prepare release dnet-openaireplus-workflows-5.0.5
replaced function for generation of MD5 in xslt
[maven-release-plugin] prepare release dnet-openaireplus-workflows-5.0.4
fixed bug on trasformation of pangaea datasets
Changed MD5 fnuction on DatasetsfromPangaeaIngestion
delete from cache policy
[maven-release-plugin] prepare release dnet-openaireplus-workflows-5.0.3
feed claims node
bug fixing
[maven-release-plugin] prepare release dnet-openaireplus-workflows-5.0.2
ignoring log file
exceptions
fixed typo in sql query
some caches and constraints
partial implementation of FeedMissingClaimsJobNode
added some properties
updated workflow creation dates
[maven-release-plugin] prepare release dnet-openaireplus-workflows-5.0.1