hadoop profiles for the orcid propagation
Synchronised with the version currently running on beta after fixing the namespace prefix for RSF, innoviris and RIF
fixed XPath to get value for fundedamount (miss . in path)
fixed issue #4908
Fixes #5090: opendoar repo without repository names
Added currency for the amount funded to each organization. Currency set to EUR by default since we have projects funded by the EU
deduped orgs to OpenOrgs DB (jobs + wfs) using temporary hdfs files
new transformation rule to import open orgs similarities
Hadoop Job for propagation of community results through organization
renamed file
introduced distcp configuration profile to synchronise data from DM to IIS cluster
#3535: context profile for EPOS
#4996: funding for RISIS projects
#4996: fixed profile format for RISIS
#4996: updated information for RISIS
#4982: changed id and label for the EuroMarine context profile and updated file name
#4990: context profile for beopen - transport research
new contexts for RI mes and RISIS
fixed projectid in ifremer infrastructure profile
#4995: profile for IFREMER
updated transformation to match #4969 and added if statement to handle cases where the contribution for the organization is not a valid number (i.e. the value is simply '.'). In cases like this one generally the contribution for the organization in 0. In the transformation rule when the value to be transformed is not a number (that is a Natural number) the contribution element is not produced
Updated transformations to match #4969
Transformation rules modified to consider also funded amount and summary for the projects. Not all the rules consider both the updates (not all the funders provide both funded amount and summary for the project) ref #4965
Disable dataset mining module in pre-processing
#4852: updated vocabulary of provenance with labels for the portals in english name
transformation rule fro Russian Science Foundation
Change the namespaceprefix of the funder and the funder name and shortcut
Addressing #2563: OA-PG context re-registered on beta
hadoop jobs for propagation of project result through semantic relation and for result to community thorugh semantic relation
Updated label of Aginfra community
added profile for transform and collect in the new dnet-hadoop system
new HadoopJob profile for dumping proto with merged updates
added filterIndexRecordsJob configuration profile
added STATUS element children to match the profile schema
#4671: versioning also for OpenDOAR repositories, when available
#4671: versioning index field for datasources
formatting
Converted text to CDATA
Updated list of ELIXIR-GR tool
Copied current version from beta
added workflow configuration and hadoop job for orcid publications without doi import
added field organizationdupid, distint values for organizationlegalname, organizationlegalshortname, organizationalternativenames
reformat code
#4570: one Landing page only
Fix #4570: proper Landing pages to EGI App DB
Fix #4572: dateAceepted from dateType Issued (as according to the guidelines for software and orp)
#Fix #4568: New rule for EGI App DB with correct handling of rights.Rule created from xslt_cleaning_datarepo_datacite.xml
Added missing rule
mapping dateAccepted, accessrights, license, language, CobjCategory from eny depth and using local-name (namespace unaware)
updated for new funding stream information
#4187: id is now biotoolsID
Rule for datacite and the new cap copied from beta
fixes #2563: too many results for OA postgrant pilot
Fixes #4235#note-4: multiple creator names in literal in claims from datacite
FIXES #4427#note-4: not all orgs are finnish
Only one landing page
New ds typologies
added hadoop Job of transform and collection for new Implemetation of Dnet Hadoop
Fixes #4235
Transfromation for NeuroVault
handle case no funding is given for SNSF projects
Tagging for elixir-gr based on the list of tools. Everything related to the concept has been grouped in the same section of the xslt.
emails to check are in credit not contact
leave out orcid ids
removed datatcite namespace on authors
Let's consider authors also primary contacts, otherwise we exclude too many records.
Let's also check for the elixirNode value
Adding elixir-gr concepts only if the author belongs to the given list that I cannot commit
TDS for biotools
Deleted unused rule
Fixed oaf template for access rights, collected from and hostedby
Map all non-inferred subjects and pass their classid and classname.Info about access right is in bestaccesright but for retrocompatibility we should still support the old bestlicense.Hostedby defaults to OpenAIRE instead of the UnknownRepository.
moved transformation for oaf and dr elements out of result element
TDS for DOE Code
right, not rights
use bestaccessrights, not bestlicense
GB is the good country code, instead of UK
Removed old opendoar mapping to db
Instead of removing legalshortname, we consider a shortname the additional name that is all in caps. XSLT upgraded to xslt 2.0 to exploit the built-in function
Setting ec_nonprofit attribute from re3data when we can
Do not map the first institutionAdditionalName into legalshortname, because the definition of the field is "The alternative name or acronym for the responsible institution": we are not sure it is an acronym.
implementation of the procedure to export native softwares on hdfsaddition of needed workflows and classes
Mapping DOAJ subjects into subjects of dsm_datasources
introducing S3 storage related parameters
Setting claim=false to projects/zenodo communities/content providers and subcommunities in case there are not
changed transformation rule to match the new data
Implemented ORCID event generation process and relative configuration profileAdded workflow to orchestrate the event generation for software links
compress output in prepareIndexDataJob
commented optional2 because of lenght constraints in the field optional2 in the db
moved jsonextrainfo to optional2 due to mining issues
moved info in jsonextrainfo to optional2
the format of start and end date was wrong. Changed from dd-MM-yyyy to dd/MM/yyyy
added fields relresulttype and relclass
added the new field creationdate as param to the community profiles
fixed compliance with the schema
check for removing records with missing titles