Project

General

Profile

Statistics
| Revision:

# Date Author Comment
29043 11/07/2014 06:54 PM Claudio Atzori

added serialization, tests

29042 11/07/2014 06:53 PM Claudio Atzori

instantiate one SAXReader for each call

29041 11/07/2014 06:52 PM Claudio Atzori

fixed format-layout-interpretation concatenation,
doesn't fail when the fieldExtractor returns a null result

29040 11/07/2014 06:51 PM Claudio Atzori

added json serialization, builds the matching key one time only

29036 11/07/2014 04:30 PM Alessia Bardi

do not upsert sets here in the mapper: we shall delegate to a separate workflow to be run after the OAI feeding is completed.

29034 11/07/2014 03:49 PM Claudio Atzori

early implementation of jar upload script

29032 11/07/2014 03:49 PM Alessia Bardi

Always new records to test how faster we go

29030 11/07/2014 03:36 PM Alessia Bardi

Always new records to test how faster we go

29014 10/07/2014 07:41 PM Alessia Bardi

Refactored class that extracts fields from records. When we can't find an expected index from the configuration to check its repeatability, the field is indexed as repeatable and a counter is updated.

29009 10/07/2014 06:33 PM Claudio Atzori

idScheme and idNamespace defined as part of the OAI configuration profile

29002 10/07/2014 05:56 PM Claudio Atzori

proto to pace mapping parses the whole entity

29000 10/07/2014 05:54 PM Alessia Bardi

Removed dependency to dnet-oai-utils to avoid inheritance of unwanted jars such as cnr-rmi-api, cnr-service-common, spring, etc., which should not appear when running a job on the cluster. Needed classes have been copied and adapted so they do not use spring anymore.

28998 10/07/2014 05:43 PM Claudio Atzori

branch to adapt the proto to pace mapping

28981 10/07/2014 09:49 AM Claudio Atzori

extended dedup configuration, including now blacklists and algorithm parameters

28904 04/07/2014 03:33 PM Alessia Bardi

Format, layout and interpretation are obtained from the collection name rather than being fixed.

28467 26/06/2014 04:38 PM Claudio Atzori

namespace cleanup

28457 25/06/2014 07:20 PM Claudio Atzori

removed unused field <dri:repositoryId/>

28411 24/06/2014 09:59 AM Claudio Atzori

removed protocolbuffers dependency from dnet-pace-core, Builders and Proto specific tests moved in dnet-openaireplus-mapping-utils, adapted dnet-mapreduce-jobs

28311 19/06/2014 03:15 PM Claudio Atzori

oaf schema location passed as parameter by the workflow

28309 19/06/2014 02:24 PM Alessia Bardi

Testing without depending on a running mdstore

28308 19/06/2014 01:49 PM Claudio Atzori

small refactor

28303 19/06/2014 12:54 PM Alessia Bardi

OAI feed map only job

28226 16/06/2014 09:30 AM Claudio Atzori

fixed oaf to xml serialization

28094 09/06/2014 04:19 PM Claudio Atzori

merged from branch 0.0.4

28083 09/06/2014 02:57 PM Claudio Atzori

tests

28082 09/06/2014 02:57 PM Claudio Atzori

inferred stuff will be expanded right after of the main entity element

28081 09/06/2014 02:56 PM Claudio Atzori

cleanup

28068 07/06/2014 11:09 AM Claudio Atzori

updated test configuration

28067 07/06/2014 11:08 AM Claudio Atzori

helper method to discover the type of the entity target of a relationship, used during the xml expansion

28066 07/06/2014 11:06 AM Claudio Atzori

avoid to emit the relatioships stored in those rows containing a deleted metadata body

28060 06/06/2014 05:53 PM Claudio Atzori

fixed relationship distribution

28058 06/06/2014 05:12 PM Claudio Atzori

almost working workflows on hbase

27941 03/06/2014 06:48 PM Claudio Atzori

dedup working

27768 24/05/2014 04:13 PM Claudio Atzori

fixed version number

27767 24/05/2014 04:12 PM Claudio Atzori

branch for 4.0.0

27199 06/05/2014 05:56 PM Claudio Atzori

added early implementation of OAI feeding job (M/R)

27148 05/05/2014 09:40 AM Claudio Atzori

fixed IIS output escaping

27147 05/05/2014 09:39 AM Claudio Atzori

added support for one way relationships

26857 17/04/2014 12:49 PM Sandro La Bruzzo

SCRIPT_COMMENT: fixed deploy.info file to the module dnet-mapreduce-jobs

26716 16/04/2014 03:51 PM Sandro La Bruzzo

SCRIPT_COMMENT: Added deploy.info file to the module dnet-mapreduce-jobs