default Hadoop cluster set to IIS, fixed typo
default Hadoop cluster set to IIS
allow to set untilDateOverride in hadoop-specific collection workflows
updated hadoop aggregation workflow reporting
we lost the information for one eventual level 2 in the funding stream with the data we get from the api
context for new funder chistera and changed name for old funders RCUK (now UKRI) and GSRT (now GSRI)
changed to consider funding stream to level two
updated hadoop-specific aggregation workflows
updated hadoop-specific collection workflow parameters: allow to configure the http client behaviour
Hadoop-based collection workflow: capture oozie output params
hadoop based transformation workflow
new context creation for UKRI
context for ANR projects
No UNKNOWN funding streams for FP7
#5297: mappings for the European Environment Agency
#4993: sql to manually add entry in hostedby map
merged with branch dnet-hadoop
#4852: sysimport:crosswalk is enough
changed the original name with the English name
fixed wrong number of chars in namespaceprefix
changed the context for considering the founding stream and the change in the name of the funder
Russian Science Foundation context
Cleaned up namespaces and handle objIdentifier linke in dnet50 mdbuilder.xslt.st
Normalize space of metadata identifier path as it is done in "normal" mdBuilder
Validation step also for aggregator of data repository, as requested in #4297
Added cris and guidelines 4 available for continous validation
reintegrated branch solr75 -r53774:HEAD
context for new funder rpf
indentation
included node filterManaged in transformWithTDS.wf
of course its typology also in the SELECT
datasourceclass column was renamed to typology
Fixed query for Datacite sets
deleted deprecated xslt for corda h2020
Fixed description node
Fixed template for funder database: there was still an arc pointing to the unexisting setFunderAcronym node
trying to fix issues with H2020 and tara project links
Node to set the funder acronym for transformation is not usable together with TDS and transformator service. The funder acronym must be cabled into the transformation rules. For Croatian projects we must create 2 distinct TDS
Allow assignment of INGEST wf to aggregator of data repos
fixed wf template for datacite OAI sets
removed old transformation for DOAJ journal list
Mappings for re3data and opendoar are now TDS rules and not anymore xslt in the classpath
Removed old XSLT for doaj article titles: the relative TDS rule must be used instead.
Mapping for journaltitles and irdb are now TDS rules and not anymore xslt in the classpath
Incremental transformation for Datacite OAI sets
Deleted old xslt: now that we have incremental transformation we have to use TDS rules
fixed wf template
context for miur funder
Funder wfs now use xslt as TDS rule for transformations
using dsm_datasources in sql expressions
using dsm_datasources
refactoring aka2db
fixed arc target node name
To support incremental update of projects in the database, useful functionality of IncrementalTransformationJobNode has been extracted to a superclass. Wf template for entity registries has been updated accordingly. Wf templates for other types of ds that were already in incremental mode have been updated for the new param name inherited from the superclass.
RCUK: handle cases when the /funder/name is blank
Incremental transformation also for entity registries
Added wf parameter for dashboard visibility
merged incremental transformation to trunk
Re-introduced <FIELD name="_dnet_resource_identifier_">: it is required if we want to enable the updates on the database.
fixing mappings for datasource subjects, languages, contenttypes
using dsm_datasources instead of datasources table, fixed datasource data migration rules for languages and od_contenttypes
modified context and transformation for Academy of Finland in accord with #3119#note-5
removed column 'trust' from re3data mapping
fixed params in datasource entity reg aggregation workflow
merged branch dsm into trunk
Adjusted trust levels to give priorities to organizations from re3data and OpenDOAR rather than EC databases. Among corda and corda h2020, the latter wins.
Fixes #3313: record trust is now a user parameter.
Pangaea dataet: identifiers must be plain DOI (not URL) or the portal generates a wrong link.
change the funderId from DFGF to DFG
re-introduced persons for prod. New file created with the no-person mapping for h2020
revert previous commit: opendoar was not affected
ticket #2955: ensure entity registry set the proper provenanceactionclass for the entity they create
transformation for dfg funder
minor modification
context for dfg funder
XSLTs for pangaea removed from the datacite folder. They must be only in their dedicate folder of the classpath.For the pangaea by project, the format of the id slightly changed and the mappings have been changed accordingly
commented new field contactfullname in table projects. To remove the comment when new db version in beta
refactoring
context for Academy of Finland funder
transformation for Academy of Finland funder
transformation for sgov funder
trasformation for sgov context generation
more accurate mapping for contactfullname
cleanup
small changes
avoid to produce empty xml records when a 'canceled' project is found, it breaks the workflow. Fixed also the organization identifier costruction and the separator character used to parse column 'project number'
normalise spaces when extract project contact info
project contact person moved in project's description
fixed mapping
getting rid of person entities #2893