Project

General

Profile

Statistics
| Revision:

# Date Author Comment
40061 20/11/2015 05:49 PM Alessia Bardi

namespace declaration

40016 20/11/2015 03:19 PM Alessia Bardi

configuration for enriched sets

40014 20/11/2015 03:11 PM Claudio Atzori

mapping includes DOIs for datasets and preserve multiple original IDs

39623 19/10/2015 11:35 AM Michele Artini

use of external properties

39567 14/10/2015 11:58 AM Michele Artini

use of Text instead of ImmutableBytesWritable

39562 13/10/2015 04:56 PM Michele Artini

reimplemented calculatePersonDistribution M/R job to consider only the results from pubsrepositories (not journals)

39556 13/10/2015 12:55 PM Claudio Atzori

added default threshold parameters. #1209

39478 05/10/2015 12:59 PM Claudio Atzori

using about instead of dataInfo

39455 01/10/2015 06:26 PM Jochen Schirrwagen

fixed dateOfCollection;support of H2020 grantAgreement

39356 23/09/2015 01:12 PM Michele Artini

Removed Greece duplicates

39319 21/09/2015 06:13 PM Claudio Atzori

added one more dedup configuration for organizations

39298 18/09/2015 04:25 PM Claudio Atzori

informationSpaceImportJob

39296 18/09/2015 03:35 PM Claudio Atzori

updated compression parameters

39268 17/09/2015 04:39 PM Claudio Atzori

compressing output

39263 17/09/2015 09:30 AM Michele Artini

add a new term

39262 17/09/2015 08:52 AM Michele Artini

new vocabulary for NSF Contract Types

39259 17/09/2015 08:18 AM Michele Artini

nsf classification

39258 17/09/2015 08:03 AM Michele Artini

new countries and synonims from NSF projects

39253 16/09/2015 06:03 PM Jochen Schirrwagen

update of transformation rule script wrt. identifiers, fp7, h2020

39222 14/09/2015 06:06 PM Claudio Atzori

added information space export job

38998 04/09/2015 10:32 AM Alessia Bardi

added new protocol for re3data

38931 01/09/2015 12:42 PM Alessia Bardi

#1453 Publication Catalogue new vocabulary term

38843 28/08/2015 01:33 PM Jochen Schirrwagen

updated rule script of the claiming datasource with the http://dx.doi.org prefix

38718 25/08/2015 10:03 AM Alessia Bardi

added date of creation for FET context

38660 04/08/2015 03:52 PM Alessia Bardi

Updated mappings for funders and funding

38636 03/08/2015 04:10 PM Alessia Bardi

deleted oldest ec:h2020toas vocabulary

38402 22/07/2015 06:02 PM Alessia Bardi

new vocabulary for external references types.

38369 20/07/2015 02:51 PM Claudio Atzori

xslt mapping for person objects

38368 20/07/2015 02:49 PM Claudio Atzori

added hadoop jobs (dedup person)

38367 20/07/2015 02:48 PM Claudio Atzori

updated person dedup configuration

38158 10/07/2015 12:53 PM Michele Artini

added new indexed fields:
- projectoamandatepublications
- projectecarticle29_3
- projectsubject

38061 06/07/2015 03:46 PM Michele Artini

corda h2020 from ftp

38059 06/07/2015 12:06 PM Claudio Atzori

MapDocument implements a more general view of the pace model

38043 02/07/2015 10:31 AM Alessia Bardi

added trust level threshold for document similarity and document classes

38042 01/07/2015 07:32 PM Alessia Bardi

new parameter for pdb inference module

38016 29/06/2015 03:45 PM Claudio Atzori

configurable entity unpack xsl: the person id depends on the datasource typology (see 'mergeIdForHomonymsMap' param)

38012 29/06/2015 03:20 PM Claudio Atzori

configurable entity unpack xsl: the person id depends on the datasource typology (see 'mergeIdForHomonymsMap' param)

38011 29/06/2015 03:19 PM Claudio Atzori

write the publication in the person row, allowing to collect its coauthors with a m/r job

37967 25/06/2015 04:41 PM Michele Artini

update of "Horizon 2020 - Types of Action" vocabulary

37943 23/06/2015 05:49 PM Alessia Bardi

Added communityname and communityid index fields: we need to be able to exclude funders from the context browse

37909 22/06/2015 10:42 AM Claudio Atzori

added field relfundinglevel0_name

37889 19/06/2015 05:21 PM Jochen Schirrwagen

initial nlm2oaf transformation rule script

37852 18/06/2015 05:13 PM Claudio Atzori

added coauthor workflow and hadoop job

37826 16/06/2015 05:55 PM Claudio Atzori

each person row contains the list of publications, each publication embeds its authors

37779 15/06/2015 11:41 AM Michele Artini

profiles to run calculate Person Distribution

37776 15/06/2015 10:25 AM Claudio Atzori

updated job props

37751 12/06/2015 04:05 PM Claudio Atzori

added workflow to export the representative publications as json on hdfs

37649 05/06/2015 06:51 PM Claudio Atzori

updated primary iis job profile and workflow to the latest specs

37532 28/05/2015 02:46 PM Alessia Bardi

fixed index field name for relfunderjurisdiction

37530 28/05/2015 02:32 PM Alessia Bardi

index fields for funders on the relationships to projects

37473 26/05/2015 11:12 AM Claudio Atzori

search publications by author

37314 19/05/2015 10:45 AM Claudio Atzori

added attribute enabled to dedup configuration orchestrations

37271 15/05/2015 07:22 PM Claudio Atzori

added some regexes to avoid deduplicating big groups of publications

37228 13/05/2015 04:22 PM Claudio Atzori

added mapping profile for datasets

37192 13/05/2015 11:47 AM Alessia Bardi

Indx fields for project funders. #1241

37021 07/05/2015 11:37 AM Claudio Atzori

making the schema happy

37017 07/05/2015 11:14 AM Claudio Atzori

added dedup configuration and orchestration for person entities

37016 07/05/2015 11:13 AM Claudio Atzori

added oaf2hbase mapping profiles

37015 07/05/2015 11:13 AM Claudio Atzori

fixed rootbuilder entries

36744 27/04/2015 12:36 PM Alessia Bardi

Added ftp2 protocol

36740 27/04/2015 11:36 AM Claudio Atzori

added mandatory description

36736 27/04/2015 11:27 AM Claudio Atzori

removing id, not permitted by the schema

36726 27/04/2015 11:14 AM Claudio Atzori

merged branch dedupConf

35778 30/03/2015 12:13 PM Marek Horst

#953 blacklisting da458477233b5561ae47042aa2a73086 content

35756 28/03/2015 11:33 AM Marek Horst

#953 adding bea4728578070c3d66774bf9454d41fe checksum to blacklisted

35557 23/03/2015 04:43 PM Alessia Bardi

Fixed duplicate info:eu-repo/semantics/ prefix for dc:type

35555 23/03/2015 04:40 PM Alessia Bardi

resourcetype is a dataset-specific field and should not be considered when transforming publications from oaf to oai_dc

35316 12/03/2015 05:34 PM Alessia Bardi

doaj needs cleaning rule for languages.

35137 06/03/2015 09:14 AM Michele Artini

corda h2020 projects

35054 04/03/2015 05:14 PM Alessia Bardi

#1041

34904 27/02/2015 05:47 PM Alessia Bardi

some more tricks to better our Opnaire compliance

34881 27/02/2015 04:25 PM Claudio Atzori

attempt to define custom user names #1153

34865 27/02/2015 12:44 PM Alessia Bardi

Using xslt 2.0 and transforming instancetype/@classname values into camel case to try to comply to the guidelines.

34850 26/02/2015 06:16 PM Alessia Bardi

Updated datacite mdformat with the info provided by Datacite web site. Also updated names.

34848 26/02/2015 05:47 PM Andrea Mannocci

solving ticket #1158 Generate the provenance block at collection time

update profile with datacite format

34801 25/02/2015 06:21 PM Alessia Bardi

towards openaire3 compliance for OAI-PMH exports of publications and datasets

34740 23/02/2015 05:07 PM Jochen Schirrwagen

transformation script for dlib magazine

34705 23/02/2015 10:00 AM Claudio Atzori

adapting profile to schema

34646 20/02/2015 10:16 AM Michele Artini

added httpList protocol

34554 18/02/2015 11:03 AM Claudio Atzori

updated dedup configuration profile

34553 18/02/2015 10:55 AM Claudio Atzori

including merge relationship in duplicate scan phase

34440 11/02/2015 04:09 PM Claudio Atzori

extended organization join configuration

34388 10/02/2015 11:21 AM Alessia Bardi

wf and hadoop job updates to support the exclusion of persons and duplicate records during the OAI feed.

33989 19/01/2015 06:36 PM Alessia Bardi

Updated identifiers for categories and concepts to fet-fp7::* to fix #1069.

33852 13/01/2015 11:53 AM Katerina Iatropoulou

Claim EGI projects disabled. FET concepts labels changes so that the category name is not repeated.

33679 18/12/2014 06:13 PM Claudio Atzori

indentation

33368 12/12/2014 03:37 PM Claudio Atzori

added fct:funding_relations profile

33240 09/12/2014 03:18 PM Claudio Atzori

added scheduler pool name

33224 08/12/2014 01:56 PM Jochen Schirrwagen

custom rule script for HAL due to changes in HAL records since 10/2014

33038 27/11/2014 03:41 PM Sandro La Bruzzo

added two protocols to the vocabulary

32989 26/11/2014 12:52 PM Alessia Bardi

FET context

32862 18/11/2014 03:48 PM Claudio Atzori

updated profiles

32861 18/11/2014 03:48 PM Claudio Atzori

defaulting RESOURCE_URI to localhost

32860 18/11/2014 02:35 PM Claudio Atzori

added basic action set profiles

32841 17/11/2014 06:35 PM Claudio Atzori

stats conf moved in the resp. HadoopJobConfiguration profile.

32733 13/11/2014 04:10 PM Alessia Bardi

Updated pid vocabulary for orcid and FCT.

32285 06/11/2014 08:22 PM Marek Horst

updating metadataextraction_excluded_checksums to 1e5b574109da731f4918c7f91fc24864 value

32280 06/11/2014 05:31 PM Sandro La Bruzzo

added the rule that import relateddatacite

32182 04/11/2014 07:53 PM Alessia Bardi

OAI config prfofile in sync with the one on services.openaire.

32178 04/11/2014 07:44 PM Claudio Atzori

updated job profile