Project

General

Profile

Statistics
| Revision:

# Date Author Comment
59849 13/11/2020 05:48 PM Miriam Baglioni

removede some logs and added check to fix issue with fct rest template

59848 13/11/2020 05:44 PM Alessia Bardi

Excluding also doi.org

59845 13/11/2020 04:45 PM Miriam Baglioni

changed way to express if condition

59844 13/11/2020 04:41 PM Miriam Baglioni

other logs

59842 13/11/2020 04:29 PM Miriam Baglioni

added athor log.debug to verify code is doing what expected

59841 13/11/2020 04:14 PM Miriam Baglioni

added logs in debug to verify the header parameter in the basic authentication

59836 13/11/2020 11:43 AM Miriam Baglioni

added the basic authentication strategy

59308 31/08/2020 10:55 AM Alessia Bardi

Do not even try to resolve if the String does not match the crossref DOI regexp

59307 31/08/2020 10:14 AM Alessia Bardi

This should fix the problem of stackoverflow on the recursive hasNext

59095 13/07/2020 05:34 PM Alessia Bardi

DOIResolver plugin now supports multiple csv files in the input folder (baseURL) and incremental

59084 13/07/2020 12:02 PM Alessia Bardi

Logging

59080 13/07/2020 11:39 AM Alessia Bardi

Trying to solve discoverability of the new DOIResolver plugin

58995 25/06/2020 07:29 PM Alessia Bardi

Plugin that collects records from DOI resolvers based on a given list of DOIs

58994 25/06/2020 07:26 PM Alessia Bardi

refactored methods working with json so they can be reused by other plugins

58993 25/06/2020 07:25 PM Alessia Bardi

refactored methods working with json so they can be reused by other plugins

58977 22/06/2020 02:55 PM Alessia Bardi

Updated javadoc

58976 22/06/2020 02:55 PM Alessia Bardi

Log in english

58603 05/05/2020 04:37 PM Alessia Bardi

Log more at the debug level instead of the info level

58532 23/04/2020 05:25 PM Andreas Czerniak

RestCollectorPlugin: fix the issue to discover the end of deep-cursor, v1.4.3-SNAPSHOT

58528 23/04/2020 12:41 PM Andreas Czerniak

RestCollectorPlugin: prevent NullPointerException at mdStore, v1.4.2-SNAPSHOT

58521 22/04/2020 08:49 PM Andreas Czerniak

RestCollectorPlugin: fix termination of deep-cursor; v1.4.1-SNAPSHOT

58520 22/04/2020 08:43 PM Andreas Czerniak

RestCollectorPlugin: fix termination of deep-cursor; v1.4.1-SNAPSHOT

58488 14/04/2020 01:30 PM Andreas Czerniak

RestCollectorPlugin: change to iteration on deep-cursor

58457 09/04/2020 09:23 PM Andreas Czerniak

Rest-Collector plugin: changes deep-cursor for CrossRef, v1.3.39-SNAPSHOT, sf

58456 09/04/2020 09:05 PM Andreas Czerniak

Rest-Collector plugin: changes deep-cursor for CrossRef, v1.3.39-SNAPSHOT

58452 09/04/2020 04:21 PM Andreas Czerniak

Rest-Collector plugin: deep-cursor for CrossRef, v1.3.37-SNAPSHOT

58451 09/04/2020 03:26 PM Andreas Czerniak

Rest-Collector plugin: fix issue with RestCollectorPlugin class, no version chg

58449 08/04/2020 10:17 PM Andreas Czerniak

Rest-Collector plugin: additional log.debug statements

58445 08/04/2020 03:52 PM Alessia Bardi

Use XMLCleaner

58428 07/04/2020 09:26 PM Andreas Czerniak

RestCollector plugin - new resumptionType: deep-cursor, exception added

58427 07/04/2020 09:14 PM Andreas Czerniak

RestCollector plugin - new resumptionType: deep-cursor

58383 02/04/2020 07:42 PM Andreas Czerniak

adding loop to clean-up the json format

58315 23/03/2020 02:08 PM Andreas Czerniak

add AuthMethod Bearer to RestIterator class, changed version to 1.3.32-SNAPSHOT

57700 27/11/2019 10:47 AM Miriam Baglioni

fixed issue on parsing argument parameter for the plugin

56854 06/08/2019 02:01 PM Alessia Bardi

Test DataciteESIterator's decompression method

56853 06/08/2019 12:26 PM Alessia Bardi

Log exception when decompressing

56852 06/08/2019 12:18 PM Alessia Bardi

Removed baseUrl value: must be passed via the aggregator GUI

56282 01/07/2019 11:08 AM Alessia Bardi

log target URL in debug

55320 12/04/2019 03:17 PM Alessia Bardi

Fixed timestamp calculation

55315 12/04/2019 02:46 PM Alessia Bardi

2nd attempt for date using apache lib

55311 12/04/2019 02:27 PM Alessia Bardi

Fixing date parsing on DataciteCollector Plugin

55262 10/04/2019 02:25 PM Alessia Bardi

Datacite plugin now gets the baseUrl from the interfacedescriptor

55216 08/04/2019 12:11 PM Sandro La Bruzzo

implemented Datacite collector plugin from Elasticsearch dump

53854 17/11/2018 06:20 PM Alessia Bardi

code formatting

53853 17/11/2018 06:18 PM Alessia Bardi

fixed class name for logs

53781 15/11/2018 12:56 PM Andreas Czerniak

Issue Enhancement #3858 and code cleanup in RestIterator

53691 09/11/2018 03:50 PM Claudio Atzori

Kaggle/Reactome: added configurable params

53688 09/11/2018 02:20 PM Giorgos Papanikos

added configurable producer timeout

53685 09/11/2018 01:50 PM Giorgos Papanikos

Added safeguard parsing of produced xml to catch badly escaped illegal characters (eg etf: \001 escaped as ). Changed harvesting thread execution from ThreadExecutor to Thread.Start

53683 09/11/2018 11:45 AM Claudio Atzori

Kaggle/Reactome: factored out file write procedure

53677 08/11/2018 05:29 PM Giorgos Papanikos

added more logging and removed fair option from blocking queue

53664 08/11/2018 11:29 AM Giorgos Papanikos

Updatged string identifier type reference to use enum

53663 08/11/2018 11:21 AM Claudio Atzori

added main classes to verify the content collected from Kaggle and Reactome

53660 08/11/2018 10:41 AM Giorgos Papanikos
53654 07/11/2018 06:29 PM Miriam Baglioni

fixed issue

53653 07/11/2018 06:25 PM Miriam Baglioni
53652 07/11/2018 06:15 PM Miriam Baglioni

fixed issue for stopping iteration execution

53641 06/11/2018 06:49 PM Giorgos Papanikos

Added default empty dataset document serialization for endpoints where no dataset can be retrieved

53616 04/11/2018 03:23 PM Giorgos Papanikos

Corrected Creation Date format

53615 04/11/2018 02:33 PM Giorgos Papanikos

deleted dead code. shouldn't have been commited in the first place

53614 04/11/2018 02:23 PM Giorgos Papanikos

Added schema.org harvesting plugin. Supports sitemapindex files and api listing calls to retrieve endpoints list

53183 19/09/2018 09:15 AM Andreas Czerniak

enhancement of new resumptionType, Issue Enhancement #3858

53163 18/09/2018 09:07 AM Andreas Czerniak

fix JSON replacement with cleanUnwantedJsonCharsInXmlTagnames

53123 14/09/2018 02:12 PM Andreas Czerniak

use XmlCleaner for cleaning up XML results and
prepare for next revision.

53116 13/09/2018 03:55 PM Andreas Czerniak

org.json.XML - update maven package version to 20180813
better unicode support

53071 11/09/2018 05:12 PM Miriam Baglioni

add fix related to #3849 (control characters in xml files break the transformation)

52997 03/09/2018 09:26 AM Andreas Czerniak

org.json.XML - Workaround for JSON element names -> XML tagnames.
remove resumptionParam&-Type from first 'query' URL.

52983 24/08/2018 10:10 AM Andreas Czerniak

Additional to discover option in the Rest_Json CollectorPlugin for the enhancements of the new OpenDOAR API at JISC under https://v2.sherpa.ac.uk/opendoar/

52982 23/08/2018 04:08 PM Andreas Czerniak

Additional comments, debugging output in the Rest_Json CollectorPlugin for the enhancements of the new OpenDOAR API at JISC under https://v2.sherpa.ac.uk/opendoar/

52979 23/08/2018 09:23 AM Andreas Czerniak

Additional comments, debugging output and small changes in the Rest_Json CollectorPlugin for the enhancements of the new OpenDOAR API at JISC under https://v2.sherpa.ac.uk/opendoar/

52971 14/08/2018 11:43 AM Andreas Czerniak

Small changes in the Rest_Json CollectorPlugin for the enhancements of the new OpenDOAR API at JISC under https://v2.sherpa.ac.uk/opendoar/

52970 10/08/2018 11:27 AM Andreas Czerniak

Changes in the Rest_Json CollectorPlugin with enhancements for the new OpenDOAR API at JISC under https://v2.sherpa.ac.uk/opendoar/

52783 23/07/2018 11:52 AM Miriam Baglioni

use HttpConnector to download XML instead of VTDGen parse URL method

52644 02/07/2018 03:15 PM Miriam Baglioni

remove "\n" from all the cell contents

52643 02/07/2018 02:42 PM Miriam Baglioni

fix issue in input data

52618 29/06/2018 02:07 PM Miriam Baglioni

minor

52614 29/06/2018 01:57 PM Miriam Baglioni

changes in the implementation of the iterator

52611 29/06/2018 12:42 PM Miriam Baglioni

little adjustment to fix data format in input data

52520 18/06/2018 12:21 PM Claudio Atzori

cleanup

52519 18/06/2018 12:07 PM Miriam Baglioni

minor

52518 18/06/2018 11:23 AM Miriam Baglioni

fix for package name (HTTPWithFileName -> httpfilename and fixed issue on iterator for HTTPWithFileNameCollectorIterable

52514 15/06/2018 06:39 PM Claudio Atzori

small refactor

52510 15/06/2018 05:09 PM Claudio Atzori

small refactor

52496 15/06/2018 02:14 PM Claudio Atzori

used blocking methods in HTTPWithFileNameCollectorIterable

52240 25/05/2018 05:28 PM Miriam Baglioni

added information to the associated URL for junk metadata

52238 25/05/2018 05:08 PM Miriam Baglioni

fixed issue when .jos metadata extension contain xml content

52237 25/05/2018 03:21 PM Miriam Baglioni

code cleaning

52235 25/05/2018 02:51 PM Miriam Baglioni

changed implementation of data gathering

52230 25/05/2018 12:54 PM Miriam Baglioni

some logs for debugging reasons added

52102 18/05/2018 04:43 PM Miriam Baglioni

minor

52100 18/05/2018 04:10 PM Miriam Baglioni

stupid mistake

52099 18/05/2018 04:08 PM Miriam Baglioni

minor

52093 18/05/2018 11:58 AM Miriam Baglioni

check for malformed json

52059 16/05/2018 04:51 PM Miriam Baglioni

remove DOCTYPE from metadata xml

52056 16/05/2018 03:34 PM Miriam Baglioni

minor

52054 16/05/2018 03:01 PM Miriam Baglioni

filtering metadata and added param in template to specify what to filter out

52036 15/05/2018 03:31 PM Miriam Baglioni

removed DOCTYPE from xml metadata document

52034 15/05/2018 02:52 PM Miriam Baglioni

minor

52031 15/05/2018 02:10 PM Miriam Baglioni

modified update of information in xml metadata

52026 14/05/2018 05:57 PM Miriam Baglioni

considered the case metadata are given in xml format instead of json