added index field: reldatasourcecompatibilityid
expanding new relationships, added openairecompatibility in organization expansion
#1583: introducing openaire2.0_data compliance
introduced mapping for dateOfTransformation
using new metadata cache location
introduced new hadoop job profiles (dedup)
[maven-release-plugin] prepare for next development iteration
[maven-release-plugin] prepare release dnet-openaireplus-profiles-1.0.10
native english name renamed to proprietary
#1265 intoduced mapping for ORCID
mappings moved from classpath as transformation profiles
added new relations: supplement, part, contributor
mapping name
fixed mapping for data provision workflow: oaf2hbase. added mapping that writes the publication in the person row, allowing to collect its coauthors with a m/r job
namespace declaration
[maven-release-plugin] prepare release dnet-openaireplus-profiles-1.0.9
configuration for enriched sets
mapping includes DOIs for datasets and preserve multiple original IDs
use of external properties
use of Text instead of ImmutableBytesWritable
reimplemented calculatePersonDistribution M/R job to consider only the results from pubsrepositories (not journals)
added default threshold parameters. #1209
using about instead of dataInfo
fixed dateOfCollection;support of H2020 grantAgreement
Removed Greece duplicates
added one more dedup configuration for organizations
informationSpaceImportJob
updated compression parameters
compressing output
add a new term
new vocabulary for NSF Contract Types
nsf classification
new countries and synonims from NSF projects
update of transformation rule script wrt. identifiers, fp7, h2020
added information space export job
added new protocol for re3data
#1453 Publication Catalogue new vocabulary term
updated rule script of the claiming datasource with the http://dx.doi.org prefix
added date of creation for FET context
Updated mappings for funders and funding
deleted oldest ec:h2020toas vocabulary
new vocabulary for external references types.
xslt mapping for person objects
added hadoop jobs (dedup person)
updated person dedup configuration
[maven-release-plugin] prepare release dnet-openaireplus-profiles-1.0.8
added new indexed fields:- projectoamandatepublications- projectecarticle29_3- projectsubject
corda h2020 from ftp
MapDocument implements a more general view of the pace model
added trust level threshold for document similarity and document classes
new parameter for pdb inference module
configurable entity unpack xsl: the person id depends on the datasource typology (see 'mergeIdForHomonymsMap' param)
write the publication in the person row, allowing to collect its coauthors with a m/r job
update of "Horizon 2020 - Types of Action" vocabulary
Added communityname and communityid index fields: we need to be able to exclude funders from the context browse
added field relfundinglevel0_name
initial nlm2oaf transformation rule script
added coauthor workflow and hadoop job
each person row contains the list of publications, each publication embeds its authors
profiles to run calculate Person Distribution
updated job props
added workflow to export the representative publications as json on hdfs
updated primary iis job profile and workflow to the latest specs
fixed index field name for relfunderjurisdiction
index fields for funders on the relationships to projects
search publications by author
added attribute enabled to dedup configuration orchestrations
added some regexes to avoid deduplicating big groups of publications
added mapping profile for datasets
[maven-release-plugin] prepare release dnet-openaireplus-profiles-1.0.7
Indx fields for project funders. #1241
making the schema happy
added dedup configuration and orchestration for person entities
added oaf2hbase mapping profiles
fixed rootbuilder entries
Added ftp2 protocol
[maven-release-plugin] prepare release dnet-openaireplus-profiles-1.0.6
added mandatory description
[maven-release-plugin] prepare release dnet-openaireplus-profiles-1.0.5
removing id, not permitted by the schema
[maven-release-plugin] prepare release dnet-openaireplus-profiles-1.0.4
reverted pom
merged branch dedupConf
#953 blacklisting da458477233b5561ae47042aa2a73086 content
#953 adding bea4728578070c3d66774bf9454d41fe checksum to blacklisted
Fixed duplicate info:eu-repo/semantics/ prefix for dc:type
resourcetype is a dataset-specific field and should not be considered when transforming publications from oaf to oai_dc
doaj needs cleaning rule for languages.
corda h2020 projects
#1041