updated hadoop-specific aggregation workflows
new context creation for UKRI
Query updated to ignore funders that are not ready for production, but have been aggregated into production
context for ANR projects
No UNKNOWN funding streams for FP7
Added OpenAIRE 4.0 compatibility
#5297: mappings for the European Environment Agency
Including currency
query to import OpenOrgs without acronyms
#4993: sql to manually add entry in hostedby map
sql queries for open orgs similarities
(openorgs) added schemeid to pids in sql query
temporary test
Import of OpenOrgs Organizations
mereged transformation Job spark
#4967: change for datasources
merged with branch dnet-hadoop
#4852: sysimport:crosswalk is enough
changed the original name with the English name
fixed wrong number of chars in namespaceprefix
changed the context for considering the founding stream and the change in the name of the funder
Russian Science Foundation context
fixes #4865: do not export OAI PMH URLs that are not used
Cleaned up namespaces and handle objIdentifier linke in dnet50 mdbuilder.xslt.st
Normalize space of metadata identifier path as it is done in "normal" mdBuilder
fixed name of node in bean
added bean for propagation of community result through semantic relation
commented in the applicationContext all the hadoop implementation specific classes
MERGE dnet-openaireplus-workflows dnet-hadoop 56022:56364 into trunk
Validation step also for aggregator of data repository, as requested in #4297
Added cris and guidelines 4 available for continous validation
Updates for Selection Criteria implementation
get journal fields from the database
reintegrated branch solr75 -r53774:HEAD
fixed typo querying datasources
distinct PIDs for datasources
Added new case for CRIS compatibility
fixed bean name
added utilities to load the ontology profiles and pass them to the M/R jobs
context for new funder rpf
new consistency nodes
Workflow to test invalid repo profiles
Consistency of sizes (mdstores,objectstores,repo profiles and db)
resolve country codes in the mapping towards hbasee
update for bulktagging properties and node bean definition
indentation
included node filterManaged in transformWithTDS.wf
Removed reference to old orgs table and added ec specific fields of organizations when loading onto HBASE
of course its typology also in the SELECT
datasourceclass column was renamed to typology
Fixed query for Datacite sets
deleted deprecated xslt for corda h2020
Fixed description node
Fixed template for funder database: there was still an arc pointing to the unexisting setFunderAcronym node
trying to fix issues with H2020 and tara project links
Node to set the funder acronym for transformation is not usable together with TDS and transformator service. The funder acronym must be cabled into the transformation rules. For Croatian projects we must create 2 distinct TDS
added WF for tranform scholexplorer links into actionsets
Allow assignment of INGEST wf to aggregator of data repos
fixed wf template for datacite OAI sets
removed old transformation for DOAJ journal list
Mappings for re3data and opendoar are now TDS rules and not anymore xslt in the classpath
Removed old XSLT for doaj article titles: the relative TDS rule must be used instead.
Mapping for journaltitles and irdb are now TDS rules and not anymore xslt in the classpath
Incremental transformation for Datacite OAI sets
Deleted old xslt: now that we have incremental transformation we have to use TDS rules
queryProjectOrganization.sql reads from dsm_organizations
we don't need anymore prepareQueryDatasources.sql
fixed wf template
context for miur funder
take lastupdate in consideration for orgs and projects
Funder wfs now use xslt as TDS rule for transformations
using dsm_datasources in sql expressions
using dsm_datasources
refactoring aka2db
fixed arc target node name
To support incremental update of projects in the database, useful functionality of IncrementalTransformationJobNode has been extracted to a superclass. Wf template for entity registries has been updated accordingly. Wf templates for other types of ds that were already in incremental mode have been updated for the new param name inherited from the superclass.
RCUK: handle cases when the /funder/name is blank
openaireLayoutToRecordStylesheet.xsl moved in dnet-openaireplus-mapping-utils so that it can be shared among openaire workflows and the rest controller for direct index feeding
Incremental transformation also for entity registries
Added wf parameter for dashboard visibility
merged incremental transformation to trunk
3577#note-2: added also for aggregator::softwarerepository (e.g. SoftwareHeritage)
3577#note-2: correct classnames for Software and ORP repositories
Re-introduced <FIELD name="_dnet_resource_identifier_">: it is required if we want to enable the updates on the database.
hardcoded mapping between datasource typology code and label
fixing mappings for datasource subjects, languages, contenttypes
using dsm_datasources instead of datasources table, fixed datasource data migration rules for languages and od_contenttypes
modified context and transformation for Academy of Finland in accord with #3119#note-5
removed trailing ; in sql
assume the view orgs is already there
removed column 'trust' from re3data mapping
fixed params in datasource entity reg aggregation workflow
upgraded version of jdbc driver for postgres, this includes a lot of changes, Because we have to add @transactional annotation in some properties to avoid connection closed exception
merged branch dsm into trunk
Adjusted trust levels to give priorities to organizations from re3data and OpenDOAR rather than EC databases. Among corda and corda h2020, the latter wins.
adding back dear old claim sql queries, we still need them