Project

General

Profile

Statistics
| Revision:
Name Size Revision Age Author Comment
brokerAdditionJob.xml 3.11 KB almost 8 years claudio.atzori
brokerEnrichmentJob.xml 4.15 KB 52826 almost 6 years Claudio Atzori adjusted broker events jobs
brokerEnrichmentProjectsJob.xml 4.86 KB 53260 over 5 years Claudio Atzori introduced mapping resulttype -> portal url
brokerEnrichmentSoftwareLinksJob.xml 4.71 KB 54764 over 5 years Claudio Atzori Implemented ORCID event generation process and ...
brokerJoinProjectPublicationJob.xml 3.18 KB 47980 almost 7 years Claudio Atzori added job conf for the broker project events
buildMergedToAnchorMapJob.xml 3.38 KB over 8 years claudio.atzori
bulkTaggingJob.xml 2.9 KB 58324 about 4 years Claudio Atzori added hbase.client.keyvalue.maxsize=0 to avoid ...
calculatePersonDistributionStep1Job.xml 3.34 KB over 8 years michele.artini
calculatePersonDistributionStep2Job.xml 3.16 KB over 8 years michele.artini
coauthorUpdateJob.xml 2.99 KB almost 9 years claudio.atzori
connectedComponentsJob.xml 2.69 KB 53090 over 5 years Claudio Atzori use 100 reducers for the connectedComponentsJob
copyTableJob.xml 1.88 KB over 9 years claudio.atzori
countInfospaceJob.xml 3.28 KB 53707 over 5 years Claudio Atzori added job profiles for infoSpace counts
dedupAnchorStatsJob.xml 2.75 KB about 8 years claudio.atzori
dedupBuildRootsJob.xml 3.1 KB about 9 years claudio.atzori
dedupCandidateScanJob.xml 3.49 KB about 9 years claudio.atzori
dedupDeleteDedupRelsJob.xml 3.44 KB about 8 years claudio.atzori
dedupDeleteSimRelsJob.xml 3.08 KB almost 9 years claudio.atzori
dedupExportPersonFullnameJob.xml 3.22 KB about 8 years claudio.atzori
dedupFindPersonRootsJob.xml 3.41 KB almost 9 years claudio.atzori
dedupFindRootsJob.xml 3.02 KB about 10 years claudio.atzori
dedupFixRelationsJob.xml 4.25 KB 54155 over 5 years Claudio Atzori added new job
dedupGTCleanerJob.xml 3.03 KB about 8 years claudio.atzori
dedupGrouperJob.xml 2.98 KB about 10 years claudio.atzori
dedupIndexFeedJob.xml 3.79 KB about 9 years claudio.atzori
dedupMarkDeletedEntityJob.xml 2.7 KB 49328 over 6 years Claudio Atzori getting rid of person entities
dedupMergeCoAuthors.xml 3.19 KB almost 9 years claudio.atzori
dedupMinDistGraphJob.xml 2.72 KB over 8 years claudio.atzori
dedupPersonJob.xml 3.18 KB almost 9 years claudio.atzori
dedupRootsExportJob.xml 2.78 KB almost 9 years claudio.atzori
dedupRootsPersonExportJob.xml 2.79 KB about 8 years claudio.atzori
dedupRootsToCSVJob.xml 4.75 KB about 8 years claudio.atzori
dedupSimilarity2ActionsJob.xml 3.34 KB 49374 over 6 years Claudio Atzori getting rid of person entities
dedupSimilarity2GraphJob.xml 2.91 KB 49328 over 6 years Claudio Atzori getting rid of person entities
dedupSimilarity2HdfsActionsJob.xml 3.27 KB 49328 over 6 years Claudio Atzori getting rid of person entities
dhp_migration_all_steps.xml 2.22 KB 58269 about 4 years Michele Artini profiles for migration of entities from mongo/p...
dhp_migration_claims.xml 2.23 KB 58269 about 4 years Michele Artini profiles for migration of entities from mongo/p...
dhp_migration_step1.xml 2.02 KB 58269 about 4 years Michele Artini profiles for migration of entities from mongo/p...
dhp_migration_step2.xml 1.97 KB 58269 about 4 years Michele Artini profiles for migration of entities from mongo/p...
dhp_migration_step3.xml 1.7 KB 58269 about 4 years Michele Artini profiles for migration of entities from mongo/p...
distcpJob.xml 1.6 KB 57414 over 4 years Claudio Atzori introduced distcp configuration profile to sync...
dnetHadoopCollectionJob.xml 2.58 KB 56724 almost 5 years Sandro La Bruzzo added profile for transform and collect in the ...
dnetHadoopTransformationJob.xml 2.19 KB 57415 over 4 years Claudio Atzori renamed file
elasticsearchTestJob.xml 2.57 KB about 8 years claudio.atzori
exportIdentifiersJob.xml 2.52 KB over 8 years claudio.atzori
exportOpenOrgsOrganizations.xml 2.85 KB 57510 over 4 years Michele Artini deduped orgs to OpenOrgs DB (jobs + wfs) using ...
exportOpenOrgsSimilarities.xml 2.85 KB 57510 over 4 years Michele Artini deduped orgs to OpenOrgs DB (jobs + wfs) using ...
exportSummaryRecordsJob.xml 2.7 KB 49918 over 6 years Claudio Atzori optional SCAN
filterIndexRecordsJob.xml 2.99 KB 56709 almost 5 years Claudio Atzori added filterIndexRecordsJob configuration profile
iisCacheBuilderJob.xml 1.65 KB 54792 over 5 years Marek Horst introducing S3 storage related parameters
iisMainJob.xml 5.99 KB over 7 years claudio.atzori
iisMainJobV2.xml 5.64 KB 54792 over 5 years Marek Horst introducing S3 storage related parameters
iisPreprocessingJob.xml 3.86 KB almost 8 years claudio.atzori
iisPreprocessingJobV2.xml 3.61 KB 57273 over 4 years Alessia Bardi Disable dataset mining module in pre-processing
iisPreprocessingQuickJob.xml 3.68 KB over 9 years marek.horst
importDOIBoostJob.xml 3.32 KB 54156 over 5 years Claudio Atzori 32 reducers seems to work better than 16
importGridAcJob.xml 2.62 KB 53164 over 5 years Claudio Atzori fixed action set name
importOrcidJob.xml 2.62 KB 56585 almost 5 years Claudio Atzori added STATUS element children to match the prof...
importScholexplorerJob.xml 2.77 KB 52936 almost 6 years Claudio Atzori fixed importScholexplorerJob configuration
indexFeedJob.xml 3.71 KB 49064 over 6 years Claudio Atzori added index.solr.compress.result parameter
informationSpaceConfigurableExportJob.xml 2.82 KB 54157 over 5 years Claudio Atzori minor changes
informationSpaceExportJob.xml 3.01 KB over 8 years claudio.atzori
informationSpaceExportSubjectsJob.xml 3.25 KB 53428 over 5 years Claudio Atzori added job to export subject terms from results
informationSpaceImportJob.xml 2.51 KB over 8 years claudio.atzori
informationSpaceMergedUpdatesExportJob.xml 3.2 KB 56719 almost 5 years Miriam Baglioni new HadoopJob profile for dumping proto with me...
informationSpaceMergedUpdatesExportMultipleOutputJob.xml 6.23 KB 58292 about 4 years Claudio Atzori infospace export procedure produces a set of ne...
informationSpaceSoftwareExportJob.xml 2.78 KB 54922 over 5 years Michele De Bonis implementation of the procedure to export nativ...
invalidRecordDoiExporterJob.xml 2.58 KB 54157 over 5 years Claudio Atzori minor changes
lodExportJob.xml 15.2 KB 53897 over 5 years Claudio Atzori re-adding lodExportJob profile
mdStoreHdfsImportAuthorsJob.xml 2.36 KB almost 9 years claudio.atzori
mdStoreHdfsImportJob.xml 2.35 KB about 10 years claudio.atzori
oaiFeedJob.xml 3.46 KB 52110 about 6 years Alessia Bardi Added new parameter to the OAI M/R jobs for the...
offlineHbaseLoadJob.xml 2.89 KB about 9 years claudio.atzori
oozieGenericJob.xml 2.49 KB 58328 about 4 years Claudio Atzori formatting
personCsvJoinJob.xml 2.53 KB almost 8 years claudio.atzori
predatoryJournalsJob.xml 2.87 KB 52963 almost 6 years Claudio Atzori added jobs for predatory journal analysis
prepareBrokerDataJob.xml 3.33 KB almost 8 years claudio.atzori
prepareIndexDataJob.xml 3.73 KB 54763 over 5 years Claudio Atzori compress output in prepareIndexDataJob
promoteActions.xml 2.31 KB about 9 years claudio.atzori
promoteMultipleActionSets.xml 2.43 KB about 8 years claudio.atzori
promoteSingleActionSet.xml 2.38 KB about 8 years claudio.atzori
propagationCountryInstitutionalOrganization.xml 4.59 KB 58324 about 4 years Claudio Atzori added hbase.client.keyvalue.maxsize=0 to avoid ...
propagationCountryInstitutionalOrganizationSaveToFile.xml 4.6 KB 57824 over 4 years Miriam Baglioni Implementation of InstOrgKeys moved in a genera...
propagationORCIDToResultJob.xml 3.4 KB 58324 about 4 years Claudio Atzori added hbase.client.keyvalue.maxsize=0 to avoid ...
propagationOrcidToResultSaveToFileJob.xml 3.47 KB 57825 over 4 years Miriam Baglioni added property propagatetoorcid.semanticrelatio...
propagationOrganizationToResultThroughDatasource.xml 4.62 KB 58324 about 4 years Claudio Atzori added hbase.client.keyvalue.maxsize=0 to avoid ...
propagationOrganizationToResultThroughDatasourceSaveToFile.xml 4.57 KB 57826 over 4 years Miriam Baglioni Hadoop job configuration for propagation of pro...
propagationOrganizationToResultThroughSemRel.xml 4.77 KB 58324 about 4 years Claudio Atzori added hbase.client.keyvalue.maxsize=0 to avoid ...
propagationOrganizationToResultThroughSemRelSaveToFile.xml 4.72 KB 57826 over 4 years Miriam Baglioni Hadoop job configuration for propagation of pro...
propagationProjectToResultJob.xml 3.5 KB 58324 about 4 years Claudio Atzori added hbase.client.keyvalue.maxsize=0 to avoid ...
propagationProjectToResultSaveToFileJob.xml 3.4 KB 57823 over 4 years Miriam Baglioni remved a space in the list of allowed semantic ...
propagationResultToCommunityThorughOrganizationSaveToFileJob.xml 3.92 KB 58049 over 4 years Miriam Baglioni update parameter name to match the name in the ...
propagationResultToCommunityThroughOrganizationJob.xml 3.77 KB 58324 about 4 years Claudio Atzori added hbase.client.keyvalue.maxsize=0 to avoid ...
propagationResultToCommunityThroughSemanticRelationJob.xml 3.67 KB 58324 about 4 years Claudio Atzori added hbase.client.keyvalue.maxsize=0 to avoid ...
propagationResultToCommunityThroughSemanticRelationSaveToFileJob.xml 3.86 KB 57822 over 4 years Miriam Baglioni added property propagatetocommunity.semanticrel...
publicationAnalysisJob.xml 2.6 KB about 8 years claudio.atzori
resetDedupJob.xml 2.41 KB about 10 years claudio.atzori
sqoopStatsUpdateJob.xml 4.46 KB over 9 years claudio.atzori
xmlRecordCounterJob.xml 2.45 KB 53707 over 5 years Claudio Atzori added job profiles for infoSpace counts
xmlRecordValidatorJob.xml 2.66 KB 54157 over 5 years Claudio Atzori minor changes

Latest revisions

# Date Author Comment
58328 25/03/2020 06:57 PM Claudio Atzori

formatting

58324 23/03/2020 05:13 PM Claudio Atzori

added hbase.client.keyvalue.maxsize=0 to avoid error KeyValue size too large from hbase client

58293 18/03/2020 09:18 AM Claudio Atzori

hadoop job for the submission of an arbitrary oozie workflow on the OCEAN hadoop cluster

58292 18/03/2020 09:17 AM Claudio Atzori

infospace export procedure produces a set of newline-delimited json text files, organized in folders, one per entity type, plus one dedicated to relationships

58269 16/03/2020 12:11 PM Michele Artini

profiles for migration of entities from mongo/postgres to hadoop (DHP)

58049 03/02/2020 10:47 AM Miriam Baglioni

update parameter name to match the name in the corresponding mapper class

58048 03/02/2020 10:26 AM Miriam Baglioni

updated hadoop profile for new implementation of composite keys

57826 05/12/2019 05:29 PM Miriam Baglioni

Hadoop job configuration for propagation of product to organization both for direct ownership of the product by the datasource that provides the organization, and for semantic relationship with a product that belongs to the organization

57825 05/12/2019 05:27 PM Miriam Baglioni

added property propagatetoorcid.semanticrelation to list the set of semantic relations allowed for propagation

57824 05/12/2019 05:25 PM Miriam Baglioni

Implementation of InstOrgKeys moved in a general folder. Changed in the profiule the path to found the key and the comparator

View revisions

Also available in: Atom