Project

General

Profile

Statistics
| Revision:

# Date Author Comment
61483 05/08/2021 01:22 PM Andreas Czerniak

add feature to change the pagination start number, enhance debugging

61481 05/08/2021 12:21 PM Andreas Czerniak

add feature to change the pagination start number, enhance debugging

61317 02/07/2021 11:56 AM Andreas Czerniak

avoid expection in RestIterator

61302 25/06/2021 04:00 PM Andreas Czerniak

to avoid null pointer exception

61301 24/06/2021 09:19 PM Andreas Czerniak

to avoid null pointer exception

61259 21/06/2021 04:18 PM Andreas Czerniak

enhance logging for generic approach

61031 21/05/2021 10:04 AM Alessia Bardi

REST plugin accepts optional Request Headers parameters as json map, e.g. {Accept:application/json}. This is useful to support PostgREST endpoints: they need a param in the header to return proper json. request headers must also be exploited for the pagination (not implemented in this commit). More details on PostgREST at https://postgrest.org/en/stable/api.html....

61017 18/05/2021 03:29 PM Andreas Czerniak

correct replace str in json

61016 18/05/2021 02:48 PM Andreas Czerniak

refine json to xml element-names

61015 18/05/2021 02:44 PM Andreas Czerniak

refine json to xml element-names

61014 18/05/2021 02:43 PM Andreas Czerniak

refine json to xml element-names

61010 17/05/2021 05:01 PM Andreas Czerniak

refining of request url creation and increase version number

61009 17/05/2021 04:37 PM Andreas Czerniak

queryParams could be emtpy

60566 03/03/2021 04:07 PM Alessia Bardi

PRIVATE : I have the impression we are not preventing a NullPointerException during collection that I am experiencing with the eurostat API of the EU Open Data Portal.

60309 28/01/2021 04:06 PM Alessia Bardi

Added info log with the REST endpoint and parameter of the first call

60307 28/01/2021 12:52 PM Alessia Bardi

Added new optional parameter in case the resultFormatValue is not 'json' but the output is in json format

59849 13/11/2020 05:48 PM Miriam Baglioni

removede some logs and added check to fix issue with fct rest template

59848 13/11/2020 05:44 PM Alessia Bardi

Excluding also doi.org

59845 13/11/2020 04:45 PM Miriam Baglioni

changed way to express if condition

59844 13/11/2020 04:41 PM Miriam Baglioni

other logs

59842 13/11/2020 04:29 PM Miriam Baglioni

added athor log.debug to verify code is doing what expected

59841 13/11/2020 04:14 PM Miriam Baglioni

added logs in debug to verify the header parameter in the basic authentication

59836 13/11/2020 11:43 AM Miriam Baglioni

added the basic authentication strategy

59308 31/08/2020 10:55 AM Alessia Bardi

Do not even try to resolve if the String does not match the crossref DOI regexp

59307 31/08/2020 10:14 AM Alessia Bardi

This should fix the problem of stackoverflow on the recursive hasNext

59095 13/07/2020 05:34 PM Alessia Bardi

DOIResolver plugin now supports multiple csv files in the input folder (baseURL) and incremental

59084 13/07/2020 12:02 PM Alessia Bardi

Logging

59080 13/07/2020 11:39 AM Alessia Bardi

Trying to solve discoverability of the new DOIResolver plugin

58995 25/06/2020 07:29 PM Alessia Bardi

Plugin that collects records from DOI resolvers based on a given list of DOIs

58994 25/06/2020 07:26 PM Alessia Bardi

refactored methods working with json so they can be reused by other plugins

58993 25/06/2020 07:25 PM Alessia Bardi

refactored methods working with json so they can be reused by other plugins

58977 22/06/2020 02:55 PM Alessia Bardi

Updated javadoc

58976 22/06/2020 02:55 PM Alessia Bardi

Log in english

58603 05/05/2020 04:37 PM Alessia Bardi

Log more at the debug level instead of the info level

58532 23/04/2020 05:25 PM Andreas Czerniak

RestCollectorPlugin: fix the issue to discover the end of deep-cursor, v1.4.3-SNAPSHOT

58528 23/04/2020 12:41 PM Andreas Czerniak

RestCollectorPlugin: prevent NullPointerException at mdStore, v1.4.2-SNAPSHOT

58521 22/04/2020 08:49 PM Andreas Czerniak

RestCollectorPlugin: fix termination of deep-cursor; v1.4.1-SNAPSHOT

58520 22/04/2020 08:43 PM Andreas Czerniak

RestCollectorPlugin: fix termination of deep-cursor; v1.4.1-SNAPSHOT

58488 14/04/2020 01:30 PM Andreas Czerniak

RestCollectorPlugin: change to iteration on deep-cursor

58457 09/04/2020 09:23 PM Andreas Czerniak

Rest-Collector plugin: changes deep-cursor for CrossRef, v1.3.39-SNAPSHOT, sf

58456 09/04/2020 09:05 PM Andreas Czerniak

Rest-Collector plugin: changes deep-cursor for CrossRef, v1.3.39-SNAPSHOT

58452 09/04/2020 04:21 PM Andreas Czerniak

Rest-Collector plugin: deep-cursor for CrossRef, v1.3.37-SNAPSHOT

58451 09/04/2020 03:26 PM Andreas Czerniak

Rest-Collector plugin: fix issue with RestCollectorPlugin class, no version chg

58449 08/04/2020 10:17 PM Andreas Czerniak

Rest-Collector plugin: additional log.debug statements

58445 08/04/2020 03:52 PM Alessia Bardi

Use XMLCleaner

58428 07/04/2020 09:26 PM Andreas Czerniak

RestCollector plugin - new resumptionType: deep-cursor, exception added

58427 07/04/2020 09:14 PM Andreas Czerniak

RestCollector plugin - new resumptionType: deep-cursor

58383 02/04/2020 07:42 PM Andreas Czerniak

adding loop to clean-up the json format

58316 23/03/2020 02:08 PM Andreas Czerniak

add AuthMethod Bearer to RestIterator class, changed version to 1.3.32-SNAPSHOT

58315 23/03/2020 02:08 PM Andreas Czerniak

add AuthMethod Bearer to RestIterator class, changed version to 1.3.32-SNAPSHOT

57700 27/11/2019 10:47 AM Miriam Baglioni

fixed issue on parsing argument parameter for the plugin

56854 06/08/2019 02:01 PM Alessia Bardi

Test DataciteESIterator's decompression method

56853 06/08/2019 12:26 PM Alessia Bardi

Log exception when decompressing

56852 06/08/2019 12:18 PM Alessia Bardi

Removed baseUrl value: must be passed via the aggregator GUI

56282 01/07/2019 11:08 AM Alessia Bardi

log target URL in debug

55320 12/04/2019 03:17 PM Alessia Bardi

Fixed timestamp calculation

55315 12/04/2019 02:46 PM Alessia Bardi

2nd attempt for date using apache lib

55311 12/04/2019 02:27 PM Alessia Bardi

Fixing date parsing on DataciteCollector Plugin

55262 10/04/2019 02:25 PM Alessia Bardi

Datacite plugin now gets the baseUrl from the interfacedescriptor

55216 08/04/2019 12:11 PM Sandro La Bruzzo

implemented Datacite collector plugin from Elasticsearch dump

53854 17/11/2018 06:20 PM Alessia Bardi

code formatting

53853 17/11/2018 06:18 PM Alessia Bardi

fixed class name for logs

53781 15/11/2018 12:56 PM Andreas Czerniak

Issue Enhancement #3858 and code cleanup in RestIterator

53691 09/11/2018 03:50 PM Claudio Atzori

Kaggle/Reactome: added configurable params

53688 09/11/2018 02:20 PM Giorgos Papanikos

added configurable producer timeout

53685 09/11/2018 01:50 PM Giorgos Papanikos

Added safeguard parsing of produced xml to catch badly escaped illegal characters (eg etf: \001 escaped as ). Changed harvesting thread execution from ThreadExecutor to Thread.Start

53683 09/11/2018 11:45 AM Claudio Atzori

Kaggle/Reactome: factored out file write procedure

53677 08/11/2018 05:29 PM Giorgos Papanikos

added more logging and removed fair option from blocking queue

53664 08/11/2018 11:29 AM Giorgos Papanikos

Updatged string identifier type reference to use enum

53663 08/11/2018 11:21 AM Claudio Atzori

added main classes to verify the content collected from Kaggle and Reactome

53660 08/11/2018 10:41 AM Giorgos Papanikos
53654 07/11/2018 06:29 PM Miriam Baglioni

fixed issue

53653 07/11/2018 06:25 PM Miriam Baglioni
53652 07/11/2018 06:15 PM Miriam Baglioni

fixed issue for stopping iteration execution

53641 06/11/2018 06:49 PM Giorgos Papanikos

Added default empty dataset document serialization for endpoints where no dataset can be retrieved

53640 06/11/2018 04:18 PM Claudio Atzori

optional parameters for the schema.org plugin

53630 05/11/2018 06:29 PM Claudio Atzori

added all the possible parameters to the Schema.org ProtocolDescriptor

53616 04/11/2018 03:23 PM Giorgos Papanikos

Corrected Creation Date format

53615 04/11/2018 02:33 PM Giorgos Papanikos

deleted dead code. shouldn't have been commited in the first place

53614 04/11/2018 02:23 PM Giorgos Papanikos

Added schema.org harvesting plugin. Supports sitemapindex files and api listing calls to retrieve endpoints list

53183 19/09/2018 09:15 AM Andreas Czerniak

enhancement of new resumptionType, Issue Enhancement #3858

53163 18/09/2018 09:07 AM Andreas Czerniak

fix JSON replacement with cleanUnwantedJsonCharsInXmlTagnames

53123 14/09/2018 02:12 PM Andreas Czerniak

use XmlCleaner for cleaning up XML results and
prepare for next revision.

53116 13/09/2018 03:55 PM Andreas Czerniak

org.json.XML - update maven package version to 20180813
better unicode support

53071 11/09/2018 05:12 PM Miriam Baglioni

add fix related to #3849 (control characters in xml files break the transformation)

52997 03/09/2018 09:26 AM Andreas Czerniak

org.json.XML - Workaround for JSON element names -> XML tagnames.
remove resumptionParam&-Type from first 'query' URL.

52983 24/08/2018 10:10 AM Andreas Czerniak

Additional to discover option in the Rest_Json CollectorPlugin for the enhancements of the new OpenDOAR API at JISC under https://v2.sherpa.ac.uk/opendoar/

52982 23/08/2018 04:08 PM Andreas Czerniak

Additional comments, debugging output in the Rest_Json CollectorPlugin for the enhancements of the new OpenDOAR API at JISC under https://v2.sherpa.ac.uk/opendoar/

52979 23/08/2018 09:23 AM Andreas Czerniak

Additional comments, debugging output and small changes in the Rest_Json CollectorPlugin for the enhancements of the new OpenDOAR API at JISC under https://v2.sherpa.ac.uk/opendoar/

52971 14/08/2018 11:43 AM Andreas Czerniak

Small changes in the Rest_Json CollectorPlugin for the enhancements of the new OpenDOAR API at JISC under https://v2.sherpa.ac.uk/opendoar/

52970 10/08/2018 11:27 AM Andreas Czerniak

Changes in the Rest_Json CollectorPlugin with enhancements for the new OpenDOAR API at JISC under https://v2.sherpa.ac.uk/opendoar/

52783 23/07/2018 11:52 AM Miriam Baglioni

use HttpConnector to download XML instead of VTDGen parse URL method

52644 02/07/2018 03:15 PM Miriam Baglioni

remove "\n" from all the cell contents

52643 02/07/2018 02:42 PM Miriam Baglioni

fix issue in input data

52618 29/06/2018 02:07 PM Miriam Baglioni

minor

52614 29/06/2018 01:57 PM Miriam Baglioni

changes in the implementation of the iterator

52611 29/06/2018 12:42 PM Miriam Baglioni

little adjustment to fix data format in input data

52520 18/06/2018 12:21 PM Claudio Atzori

cleanup

52519 18/06/2018 12:07 PM Miriam Baglioni

minor

52518 18/06/2018 11:23 AM Miriam Baglioni

fix for package name (HTTPWithFileName -> httpfilename and fixed issue on iterator for HTTPWithFileNameCollectorIterable