implementation of the procedure to export native softwares on hdfsaddition of needed workflows and classes
Change Mapper to implement DOIBoostToAction
Created DOIBoostToActions Mapping
subject terms exporter
why parse strings as Floats?
reverted to r52985 . Test runs shows we need to rely on the edgeIds produced by the connected components identfication phase instead of the vertexIds
changes to consider modification in code to align to trunck version
alignment to trunk version
avoid to produce duplicated events by eliminating the roots from the comparison process
broker event serialization test for ProjectEventFactory
introduced mapping resulttype -> portal url
[maven-release-plugin] prepare for next development iteration
[maven-release-plugin] prepare release dnet-mapreduce-jobs-1.1.5-BETA
Fixed log class name
avoid collisions when hashing pids by value
cleaned up unused method, using setDurability in put operation
updated opentrial input record used in test, fixes #3886
added mapper and hadoop job configuration file for importing Grid.AC organization data
using mapping-utils version 6.2.11
integrating bulktag from trunk to beta branch
rule out invalid dates also on CrossRefToActions
rule out invalid dates on ScholixToActions
cleanup
produce 'supplement' subrel type in case of supplement relationships
simplified connected component application on the graph
updated dependencies: dnet-pace-core
adding check to understand the bug of wrong relation generated
do not skip processing datasets in DedupBuildRootsMapper, improved error reporting in DedupBuildRootsReducer
do not push vertex ids in memory, process them on the fly
fixed name of TDS profile used by test method
added jobs for predatory journal analysis
removed warning
added invisible setup
refactored Action
fixed null element
Created CrossrefImportMapper
add CrossRefToAction
bumped dependency version
fixed mapping from scholix to openaire model
small fixes
changed key type
implemented mapper writing
added configuration
added Mapper for tranform scholexplorer links into actionsets
deprecation: use setDurability instead of setWriteToWAL
introduced subType in pace wf configuration
adjusted ids export procedure
avoid to emit enrichment events when the similarity score is below the threshold
javadoc and test
indentation
pick the 1st instance to avoid collisions
improved behaviour EventWrapperTest
Partial implementation of a unit test
Fixed the generation of eventIds
Workaround for CLARIN mining issue: #3670#note-29
depending on dnet-openaireplus-mapping-utils:6.2.8
depending on dnet-openaireplus-mapping-utils:6.2.7
depending on dnet-openaireplus-mapping-utils:6.2.5
depending on specific dnet-openaire-mapping-utils:6.2.5-SNAPSHOT to avoid solr7 branches to get in the classpath
Back to BETA versions
[maven-release-plugin] prepare release dnet-mapreduce-jobs-1.1.3-PROD
oh my pom
[maven-release-plugin] prepare release dnet-mapreduce-jobs-1.1.4-PROD
fixed scm
changing version
now we should be fine with tests for release
excluding mapping-utils 6.2.4-SNAPSHOT
depending on dnet-openaireplus-mapping-utils < 6.2.4 for production
Test for claimed relationship
expand author identifiers
generate ENRICH/MISSING/PID only when the publication didn
added CobjCategory/@type
discover the invalid character from the exception details
mapper class that parses xml records
more tests for Software and Orp entities
expand field distributionlocation in result's instances
Testing datasets with contexts. Works when using dnet-openaireplus-mapping-utils with revision >= r52278 (6.2.4-SNAPSHOT, hence updated pom to use snapshot parent)
Testing EGI as ri and using fam as example of community
Test for collected external references
[maven-release-plugin] prepare release dnet-mapreduce-jobs-1.1.3-BETA
depending on released paretn and mapping-utils 6.2.0
Including Open SOurce among the licenses
Depending on mapping-utils with solrj 4.9 and snapshot parent
Testing Open Source access rights for software
Added counters for missing date of collection and transformation
deploy.info updated with branch info
Do not add to the BasicDBObject properties that are not listed as field to index
splitAsList cannot be found when running on the cluster (dependency issues with guava?). Lets try to work around it.
OAI M/R jobs expect a new parameter that lists the date patterns to try 'services.publisher.oai.datepatterns'