using layout=store for claims aggregation workflow. This will make them to be loaded by the existing procedure.
#2214: funder original name all in low-case letters
adjusting parameters for IIS preprocessing workflow V2
NWO transformation sets jsonextrainfo for IIS
#2293: filling db field jsonextrainfo
refinements in the cache builder workflow
#2293: JSON type is not yet supported by the DB service, so let's stick to TEXT for now.
#2293: jsonextrainfo field in projects for inference
introduced iisCacheBuilder and CDH5 specific inference workflows
deprecating IIS main and preprocessing parameter preparation job nodes
fixed typo
introducing IIS cache builder Job
#2058 and in general. The funding path of funders must also start with <fundintree>, and not directly with <funder>, otherwise the final XML is built in the wrong way.
#2055: updated mapping for HR projects to link to the funder
#2055: I do not know why we must introduce fake funding_level_0. We know how to handle funding paths with only the funders. Let's do it, also for CSF
#2055: I do not know why we must introduce fake funding_level_0. We know how to handle funding paths with only the funders. Let's do it.
#2058: NSF projects might be with no funding paths. Now we always create a "blank" funding path that contains only the funder information, as it was done for ARC.
Revert to previous version of fill hosted by map
added optional1 and optional2 to map project extra information (#2017)
Mapping NWO dossier and gebied into optional1 and optional2 project fields to meet inference requirements #2017#note-20
#2209: H2020 substreams based on type of actions: added new mappings for contexts and projects.
NSF #2058: If there is no funding element at all, let's add a funding path that contains only the funder.
[broker] workflow profile
One Croatian project has no title, let's use the code if no title is found.
Croatian funders xslt updated: new funder field <originalName> and exploitation of the funder acronym in the transformation rule.
Added node to optionally set the code of the funder to be used in the ApplyXSLT step of the generateContext workflow. Checking for empty fundng trees in ContextUtils.
Handling journal datasources with EISSN instead of ISSN
Added new workflow for generic fill hosted by map
Do not map subjects
No smenatic class and scheme for subject and projects collected from NIH.
Do not create funding for empty IC_NAME
Excluding persons from NIH mapping
NIH support
indent
added mapping for NWO
Adapting existing workflows to use the new multiple switch index job nodes.
It is useless to perform an xquery by id to get the id of a profile: now using 'search_service_ID' env parameter
more logs on nodes that switch the index used by search services
We can have multiple shadow/public Search service: introducing job nodes and workflows to handle them automatically when switching index.
added deleteObject api
static methods to calculate the objId
add method to remove entries
added "type" attribute (valid values: publication, dataset)
#2091: dataset claims collected separetely from publications
#1580: connectTimeoutMs and readTimeoutMs default values set to 60000 instead of 0.
re3data updated db should not fail for repos without a base url: if this is the case now we set the fake URL: http://unavailable.base.url
Validation off by default.
added job node bean for backward compatibility
link to the funding path that only contains the funder info, in cases when directorate and divisions are not specified.
Fixed handling of date formats on WT projects
New mapping for Wellcome trust context. To be tested.
New mapping for Wellcome trust. To be tested.
Added node to trigger monitoring on the index only
#1844#note-7: tubitac xslt renamed with a more generic name, as it can be re-used in workflows for different publishers as well (journaltitles2db.xsl)
some fix in claims management
lowcase uuid for profile
fixed label for hostedby datasources in the query
ActionSet related jobNodes and relative beans moved in dnet-openaireplus-workflows (whenever possible)Added RunMDStorePluginJobNode, will be useful in awhile
first implementation of the importActionsFromHDFS workflow
Let's make the stats generation start after the groupEntities job to avoid OutOfMemory errors in the cluster.
Fixed wrong newline
Updated DOAJ2db mapping, because the source format was not matching the field names looked up by the XSLT.
params for file download configurable via the UI by the user
setting objectstore size in wf env before feeding it
added param sleepTimeMs to the download workflow template
The following parameters for file downloads are now configurable (currently as wf system params): numberOfThreads, connectTimeoutMs, readTimeoutMs
#1868 it is useless to read the response, as the post API returns nothing
#1868 Wf node and test wf to post on a VRE via the Social Networking Library
Enabling incremental collection for entityregistry::projects
Updated description of node
fixed wf name in repo-hi
Changed node name from inference to ingestion
Added new Metawprkflows that copy files from mdstore but in this case change the interpretation of the objectstore
defined single workflow to aggregate contextes and project metadata from the same mdstore (avoid to collect from remote resources twice)
still trying to treat missing funding paths as pointers to the funder (NHMRC/ARC). Perhaps this is a good one
new version of fct xslts
NSF xslts
#1592: fill hosted by workflow assignable to both entity registries and aggregators of pubs repos
#1592: the workflow for the collection of the journal list and the update of the hostedBy map can be assigned to data sources with typology aggregator::pubsrepository
#1592: context workflow only available for project entity registries
reusing sax reader instance
more logging
still trying to treat missing funding paths as pointers to the funder (NHMRC/ARC)
treating missing funding paths as pointers to the funder (NHMRC/ARC)
treating missing funding paths as pointers to the funder (NHMRC)
treating missing funding paths as pointers to the funder (ARC)
Added DropContent Node, changed download node to allow the refresh of the content
Added embargoEndDate field
Unit tests
update to changes in database service api
Fixed malformed query
added "http://" to organization urls if missing, ticket #662
Implemented mime type selection on workflow : Copy Metadata as Files (publications) from PubsRepository [Inference]
Updated node to also set namespacePrefix and datasource id in env attributes whose names are in WorkflowConstants. Old attributes are still set, but the mdbuilder should be updated to take the new attrs.Validation and fill hosted by wf st updated to use the node.
#1844 Transform the SQL queries to the hostedby table in a template
Enabling copy files from mdstore for all repos, not only webcrawl.