Project

General

Profile

Statistics
| Revision:

# Date Author Comment
29044 11/07/2014 07:31 PM Eri Katsari

m

29043 11/07/2014 06:54 PM Claudio Atzori

added serialization, tests

29042 11/07/2014 06:53 PM Claudio Atzori

instantiate one SAXReader for each call

29041 11/07/2014 06:52 PM Claudio Atzori

fixed format-layout-interpretation concatenation,
doesn't fail when the fieldExtractor returns a null result

29040 11/07/2014 06:51 PM Claudio Atzori

added json serialization, builds the matching key one time only

29039 11/07/2014 04:52 PM Andrea Mannocci

updated profile to new OAI schema

29038 11/07/2014 04:41 PM Claudio Atzori

using format-layout-interpretation to define the OAI store collection name

29037 11/07/2014 04:40 PM Alessia Bardi

The expected name of collection is format-layout-interpretation

29036 11/07/2014 04:30 PM Alessia Bardi

do not upsert sets here in the mapper: we shall delegate to a separate workflow to be run after the OAI feeding is completed.

29034 11/07/2014 03:49 PM Claudio Atzori

early implementation of jar upload script

29032 11/07/2014 03:49 PM Alessia Bardi

Always new records to test how faster we go

29030 11/07/2014 03:36 PM Alessia Bardi

Always new records to test how faster we go

29027 11/07/2014 02:23 PM Michele Artini

fixed a problem with lower case

29023 11/07/2014 01:20 PM Marek Horst

renaming root element from list to citations

29020 11/07/2014 12:46 PM Michele Artini

retry on exceptions

29019 11/07/2014 12:21 PM Michele Artini

javadoc annotations

29018 11/07/2014 12:19 PM Michele Artini

Refactoring

29017 11/07/2014 10:29 AM Marek Horst

#486 fixing integration test: introducing missing document_text_wos input port for primary/processing

29016 11/07/2014 10:26 AM Marek Horst

#486 introducing last piece missing: text collapser in front of referenceextraction_researchinitiatives joining text contents coming from already existing document_text input port and newly introduced document_text_wos input port providing WoS contents

29015 10/07/2014 11:12 PM Jochen Schirrwagen

refactor of xsl template and split function instead of recursive calls

29014 10/07/2014 07:41 PM Alessia Bardi

Refactored class that extracts fields from records. When we can't find an expected index from the configuration to check its repeatability, the field is indexed as repeatable and a counter is updated.

29013 10/07/2014 07:31 PM Alessia Bardi

pass the configuration string indented.

29012 10/07/2014 07:26 PM Jochen Schirrwagen

redefining the behaviour of the "skipRecord" rule by adding attribute "syntaxcheck" in the record header, #323

29011 10/07/2014 06:57 PM Claudio Atzori

fixed application context

29010 10/07/2014 06:34 PM Sandro La Bruzzo

Updated SolrIndexDocument now it returns an instance of SolrInputDocument

29009 10/07/2014 06:33 PM Claudio Atzori

idScheme and idNamespace defined as part of the OAI configuration profile

29008 10/07/2014 06:33 PM Sandro La Bruzzo

changed signature of the method in indexCollection
now the method lookup throws an IndexServiceException

29007 10/07/2014 06:31 PM Sandro La Bruzzo

Implemented SolrIndexCollection

29006 10/07/2014 06:11 PM Claudio Atzori

added IDSCHEME and IDNAMESPACE elements to the OAI configuration profile

29005 10/07/2014 06:04 PM Marek Horst

#486 bugfix: reordering existence filter with id relacer: we need to update identifiers first, then update existence filter

29004 10/07/2014 05:58 PM Alessia Bardi

No hadoop-parent: classes needed in mapreduce-jobs have been copied to avoid the jar to inherit "heavy dependencies" to spring and dnet IS.

29003 10/07/2014 05:56 PM Claudio Atzori

proto to pace mapping parses the whole entity

29002 10/07/2014 05:56 PM Claudio Atzori

proto to pace mapping parses the whole entity

29001 10/07/2014 05:55 PM Alessia Bardi

Need to pass idscheme and namespace parameter to the job.

29000 10/07/2014 05:54 PM Alessia Bardi

Removed dependency to dnet-oai-utils to avoid inheritance of unwanted jars such as cnr-rmi-api, cnr-service-common, spring, etc., which should not appear when running a job on the cluster. Needed classes have been copied and adapted so they do not use spring anymore.

28999 10/07/2014 05:47 PM Claudio Atzori

branch to adapt the proto to pace mapping

28998 10/07/2014 05:43 PM Claudio Atzori

branch to adapt the proto to pace mapping

28997 10/07/2014 05:12 PM Alessia Bardi

need dnet-hadoop-parent because imported by dnet-mapreduce-jobs. Added <relativePath/> to avoid build warning.

28996 10/07/2014 05:10 PM Sandro La Bruzzo

Added bean

28995 10/07/2014 05:09 PM Sandro La Bruzzo

removed Browse and weights from IndexServerDao Interface

28994 10/07/2014 05:06 PM Claudio Atzori

submittable M/R OAI feeding job

28993 10/07/2014 04:36 PM Claudio Atzori

simplified index switch wf

28992 10/07/2014 04:30 PM Claudio Atzori

manual start

28991 10/07/2014 04:23 PM Marek Horst

replacing redundant transformers/ingest/pmc/citations with already existing transformers/importer/documentmetadata/idextractor

28990 10/07/2014 04:15 PM Marek Horst

updating job.properties

28988 10/07/2014 03:49 PM Katerina Iatropoulou

Parameter check added and better string handling

28987 10/07/2014 03:37 PM Marek Horst

intregrating pmc citations ingestion with primary workflow, adjust port names, deduplicating dependencies

28986 10/07/2014 02:37 PM Claudio Atzori

derp changes

28984 10/07/2014 12:31 PM Michele Artini

Box to select Valid/pending ds

28983 10/07/2014 10:49 AM Michele Artini

Changed validate/invalidate labels

28982 10/07/2014 10:49 AM Andrea Mannocci

DS profiles updated

index format correction

28981 10/07/2014 09:49 AM Claudio Atzori

extended dedup configuration, including now blacklists and algorithm parameters

28980 10/07/2014 09:47 AM Claudio Atzori

added wf to perform only index switch (BB msg to the search service)

28979 10/07/2014 09:46 AM Claudio Atzori

renamed meta wf definition file

28978 10/07/2014 09:46 AM Claudio Atzori

small changes

28977 10/07/2014 09:45 AM Claudio Atzori

wf fails when indexId is not found in the env

28976 09/07/2014 08:50 PM mateusz.fedoryszak

duplicates handling

28975 09/07/2014 08:41 PM mateusz.fedoryszak

roll-back

28974 09/07/2014 07:06 PM mateusz.fedoryszak

META-INF dir will now be omitted when priming

28973 09/07/2014 05:55 PM mateusz.fedoryszak

dir names in parameters should not contain nameNode

28972 09/07/2014 04:11 PM Andrea Mannocci

added carousel browsing boxes and modified initial form

28971 09/07/2014 04:10 PM Andrea Mannocci
28970 09/07/2014 04:10 PM Andrea Mannocci

added two browsing fields

28969 09/07/2014 04:05 PM Andrea Mannocci

added accordion-toggle class which draws bootstrap chevrons on the right edge of an accordion according to its status

28968 09/07/2014 03:03 PM Claudio Atzori

added typologyclass to each api

28967 09/07/2014 01:12 PM Marek Horst

replacing redundant transformers/ingest/pmc/citations with already existing transformers/importer/documentmetadata/idextractor

28966 09/07/2014 01:02 PM Marek Horst

replacing redundant transformers/ingest/pmc/citations with already existing transformers/importer/documentmetadata/idextractor

28965 09/07/2014 12:53 PM Andrea Mannocci

- removed disturbing dependencies from pom
- implemented actual lookups on the IS for lightUI profiles

28963 09/07/2014 12:36 AM Antonis Lempesis

changed target namespace

28962 08/07/2014 07:27 PM Eri Katsari
28961 08/07/2014 07:25 PM Claudio Atzori

early implementation

28960 08/07/2014 06:22 PM Claudio Atzori

added UnescapeHtml util

28959 08/07/2014 06:15 PM Michele Artini

oai harvester using Jochen's HttpConnector and XmlCleaner

28958 08/07/2014 05:58 PM Claudio Atzori

added record indentation

28957 08/07/2014 05:19 PM Marek Horst

updating default job.properties

28956 08/07/2014 05:18 PM Marek Horst

updating programatical execution of chmod on meta.json file, stil not working due to "Permission denied" warnings

28955 08/07/2014 05:15 PM Marek Horst

updating default job.properties

28954 08/07/2014 05:14 PM Marek Horst

updating default job.properties

28953 08/07/2014 05:14 PM Marek Horst

updating default job.properties

28952 08/07/2014 04:57 PM Marek Horst

renaming input ports from input_citation to input_citations to be aligned with exporter subworkflow

28951 08/07/2014 04:55 PM Marek Horst

skipping exporting citation matching outcome

28950 08/07/2014 04:44 PM Marek Horst

renaming input ports from input_citation to input_citations to be aligned with exporter subworkflow

28949 08/07/2014 04:38 PM Marek Horst

#691 introducing citations exporter module, updating exporter workflow.xml definition

28948 08/07/2014 04:33 PM Andrea Mannocci

added a schema validating the lightui profiles

28947 08/07/2014 04:06 PM Michele Artini

selection on System parameters

28946 08/07/2014 03:40 PM Michele Artini

visualization of long params

28945 08/07/2014 03:34 PM Claudio Atzori

added bean wfNodeFindSearchService

28942 08/07/2014 03:05 PM Marek Horst

introducing confidenceToTrustLevelNormalizationFactor getter method

28940 08/07/2014 01:41 PM Marek Horst

fixing importing abstract after introducing fieldApprover for all Result fields

28939 08/07/2014 01:04 PM Marek Horst

#568 renaming Citations#sourceDocumentId field to Citations#documentId

28938 08/07/2014 12:55 PM Marek Horst

#568 introducing CitationEntry and Citations schemas

28937 08/07/2014 11:55 AM Marek Horst

introducing fieldApprover for all Result fields

28936 08/07/2014 10:22 AM Claudio Atzori

fixed rel direction

28935 07/07/2014 06:42 PM Eri Katsari
28932 07/07/2014 05:55 PM Eri Katsari
28931 07/07/2014 05:52 PM mateusz.fedoryszak

rename a field

28930 07/07/2014 05:48 PM Eri Katsari

testing direct sqoop import in oozie

28928 07/07/2014 05:44 PM Claudio Atzori

use of TryIndentXmlString

28925 07/07/2014 05:36 PM Claudio Atzori

added TryIndentXmlString. Doesn't break in case of broken xmls.

28924 07/07/2014 05:08 PM Claudio Atzori

early implementation