add feature to change the pagination start number, enhance debugging
avoid expection in RestIterator
to avoid null pointer exception
enhance logging for generic approach
REST plugin accepts optional Request Headers parameters as json map, e.g. {Accept:application/json}. This is useful to support PostgREST endpoints: they need a param in the header to return proper json. request headers must also be exploited for the pagination (not implemented in this commit). More details on PostgREST at https://postgrest.org/en/stable/api.html....
correct replace str in json
refine json to xml element-names
refining of request url creation and increase version number
queryParams could be emtpy
PRIVATE : I have the impression we are not preventing a NullPointerException during collection that I am experiencing with the eurostat API of the EU Open Data Portal.
Added info log with the REST endpoint and parameter of the first call
Added new optional parameter in case the resultFormatValue is not 'json' but the output is in json format
removede some logs and added check to fix issue with fct rest template
Excluding also doi.org
changed way to express if condition
other logs
added athor log.debug to verify code is doing what expected
added logs in debug to verify the header parameter in the basic authentication
added the basic authentication strategy
Do not even try to resolve if the String does not match the crossref DOI regexp
This should fix the problem of stackoverflow on the recursive hasNext
DOIResolver plugin now supports multiple csv files in the input folder (baseURL) and incremental
Logging
Trying to solve discoverability of the new DOIResolver plugin
Plugin that collects records from DOI resolvers based on a given list of DOIs
refactored methods working with json so they can be reused by other plugins
Updated javadoc
Log in english
Log more at the debug level instead of the info level
RestCollectorPlugin: fix the issue to discover the end of deep-cursor, v1.4.3-SNAPSHOT
RestCollectorPlugin: prevent NullPointerException at mdStore, v1.4.2-SNAPSHOT
RestCollectorPlugin: fix termination of deep-cursor; v1.4.1-SNAPSHOT
RestCollectorPlugin: change to iteration on deep-cursor
Rest-Collector plugin: changes deep-cursor for CrossRef, v1.3.39-SNAPSHOT, sf
Rest-Collector plugin: changes deep-cursor for CrossRef, v1.3.39-SNAPSHOT
Rest-Collector plugin: deep-cursor for CrossRef, v1.3.37-SNAPSHOT
Rest-Collector plugin: fix issue with RestCollectorPlugin class, no version chg
Rest-Collector plugin: additional log.debug statements
Use XMLCleaner
RestCollector plugin - new resumptionType: deep-cursor, exception added
RestCollector plugin - new resumptionType: deep-cursor
adding loop to clean-up the json format
add AuthMethod Bearer to RestIterator class, changed version to 1.3.32-SNAPSHOT
fixed issue on parsing argument parameter for the plugin
Test DataciteESIterator's decompression method
Log exception when decompressing
Removed baseUrl value: must be passed via the aggregator GUI
log target URL in debug
Fixed timestamp calculation
2nd attempt for date using apache lib
Fixing date parsing on DataciteCollector Plugin
Datacite plugin now gets the baseUrl from the interfacedescriptor
implemented Datacite collector plugin from Elasticsearch dump
code formatting
fixed class name for logs
Issue Enhancement #3858 and code cleanup in RestIterator
Kaggle/Reactome: added configurable params
added configurable producer timeout
Added safeguard parsing of produced xml to catch badly escaped illegal characters (eg etf: \001 escaped as ). Changed harvesting thread execution from ThreadExecutor to Thread.Start
Kaggle/Reactome: factored out file write procedure
added more logging and removed fair option from blocking queue
Updatged string identifier type reference to use enum
added main classes to verify the content collected from Kaggle and Reactome
fixed issue
fixed issue for stopping iteration execution
Added default empty dataset document serialization for endpoints where no dataset can be retrieved
optional parameters for the schema.org plugin
added all the possible parameters to the Schema.org ProtocolDescriptor
Corrected Creation Date format
deleted dead code. shouldn't have been commited in the first place
Added schema.org harvesting plugin. Supports sitemapindex files and api listing calls to retrieve endpoints list
enhancement of new resumptionType, Issue Enhancement #3858
fix JSON replacement with cleanUnwantedJsonCharsInXmlTagnames
use XmlCleaner for cleaning up XML results andprepare for next revision.
org.json.XML - update maven package version to 20180813better unicode support
add fix related to #3849 (control characters in xml files break the transformation)
org.json.XML - Workaround for JSON element names -> XML tagnames.remove resumptionParam&-Type from first 'query' URL.
Additional to discover option in the Rest_Json CollectorPlugin for the enhancements of the new OpenDOAR API at JISC under https://v2.sherpa.ac.uk/opendoar/
Additional comments, debugging output in the Rest_Json CollectorPlugin for the enhancements of the new OpenDOAR API at JISC under https://v2.sherpa.ac.uk/opendoar/
Additional comments, debugging output and small changes in the Rest_Json CollectorPlugin for the enhancements of the new OpenDOAR API at JISC under https://v2.sherpa.ac.uk/opendoar/
Small changes in the Rest_Json CollectorPlugin for the enhancements of the new OpenDOAR API at JISC under https://v2.sherpa.ac.uk/opendoar/
Changes in the Rest_Json CollectorPlugin with enhancements for the new OpenDOAR API at JISC under https://v2.sherpa.ac.uk/opendoar/
use HttpConnector to download XML instead of VTDGen parse URL method
remove "\n" from all the cell contents
fix issue in input data
minor
changes in the implementation of the iterator
little adjustment to fix data format in input data
cleanup
fix for package name (HTTPWithFileName -> httpfilename and fixed issue on iterator for HTTPWithFileNameCollectorIterable