fixed label for hostedby datasources in the query
ActionSet related jobNodes and relative beans moved in dnet-openaireplus-workflows (whenever possible)Added RunMDStorePluginJobNode, will be useful in awhile
first implementation of the importActionsFromHDFS workflow
Let's make the stats generation start after the groupEntities job to avoid OutOfMemory errors in the cluster.
Fixed wrong newline
Updated DOAJ2db mapping, because the source format was not matching the field names looked up by the XSLT.
params for file download configurable via the UI by the user
setting objectstore size in wf env before feeding it
added param sleepTimeMs to the download workflow template
The following parameters for file downloads are now configurable (currently as wf system params): numberOfThreads, connectTimeoutMs, readTimeoutMs
#1868 it is useless to read the response, as the post API returns nothing
#1868 Wf node and test wf to post on a VRE via the Social Networking Library
Enabling incremental collection for entityregistry::projects
Updated description of node
fixed wf name in repo-hi
Changed node name from inference to ingestion
Added new Metawprkflows that copy files from mdstore but in this case change the interpretation of the objectstore
defined single workflow to aggregate contextes and project metadata from the same mdstore (avoid to collect from remote resources twice)
still trying to treat missing funding paths as pointers to the funder (NHMRC/ARC). Perhaps this is a good one
new version of fct xslts
NSF xslts
#1592: fill hosted by workflow assignable to both entity registries and aggregators of pubs repos
#1592: the workflow for the collection of the journal list and the update of the hostedBy map can be assigned to data sources with typology aggregator::pubsrepository
#1592: context workflow only available for project entity registries
reusing sax reader instance
more logging
still trying to treat missing funding paths as pointers to the funder (NHMRC/ARC)
treating missing funding paths as pointers to the funder (NHMRC/ARC)
treating missing funding paths as pointers to the funder (NHMRC)
treating missing funding paths as pointers to the funder (ARC)
Added DropContent Node, changed download node to allow the refresh of the content
Added embargoEndDate field
Unit tests
update to changes in database service api
Fixed malformed query
added "http://" to organization urls if missing, ticket #662
Implemented mime type selection on workflow : Copy Metadata as Files (publications) from PubsRepository [Inference]
Updated node to also set namespacePrefix and datasource id in env attributes whose names are in WorkflowConstants. Old attributes are still set, but the mdbuilder should be updated to take the new attrs.Validation and fill hosted by wf st updated to use the node.
#1844 Transform the SQL queries to the hostedby table in a template
Enabling copy files from mdstore for all repos, not only webcrawl.
added nodes to count moved files from mdstore to objectstore
#1843: EKT journal list for hostedby
more priority to openaire2.0_data on the SQL query for the generation of datasource compatibility
Added projects and contexts
fixed a bug with datasource ids
fixed wrong nampespace imported
replaced function for generation of MD5 in xslt
fixed bug on trasformation of pangaea datasets
Changed MD5 fnuction on DatasetsfromPangaeaIngestion
delete from cache policy
feed claims node
bug fixing
exceptions
fixed typo in sql query
some caches and constraints
partial implementation of FeedMissingClaimsJobNode
added some properties
updated workflow creation dates
fixed mongo db and collection names
fixed a property
implemented an api to index a document
integrating r40247 from 4.1.X branch
fixed descriptions in repo-hi
merging r40366
implementation of the SinglePublicationSubmitter api
#1749#1749: support option also for aggregators of pubsrepositories
#1749: repo-hi for pubsrepository with support interpretation
removed tests for hbase, as they are performed in dnet-mapreduce-jobs instead
Setting the admin email when creating the metaworkflow for downloading pdfs. The email is openaire-fulltexts@openaire.eu
we do not need the t namespace
using openaire2.0_data compliance for datasets
Added jurisdiction to SFI
exception throws added to failing test
using default db name from system property, still overridable on the wf level. Let the patchHostedByNode fail in case of any exception
do not update extra fields for existing hostedBy API
validation wf param jobStatusUpdateInterval set to system param instead of user param, so it does not appear in the UI as requested by Jochen in #539
#539: renamed blacklistGuidelines wf param
updated list of mime type on download workflows
Cleaned template for validation wf.
workflow and metaworkflow to load the hbase table to hdfs, without firing the update of any provision backend.
Ticket #1588 Rename "native" compatibility to "proprietary"
reformatting
added management of oa_mandate_for_publications field
validation occurs before transformation, not in parallel
removed static files of an old UI
renamed validation wf template
Validation as a step of the aggregation workflow.
updated person dedup WF
fixed extra character
moved apply-blacklist workflow location
reimplemented calculatePersonDistribution M/R job to consider only the results from pubsrepositories (not journals)
implemented management for claim updates #1500
reintroducing queryACM.sql file
blacklist sqls moved to dedicated module
removed stats report
removed white space that prevented thecorrect registration of the meta workflow
remove UI for the stats report visualization, as we are not generating them anymore and the old collected values have been ported to the data flow monitoring.
remove generation of stats report now that we have the data flow monitoring.