#1498 introducing major citations related refactoring including new generic direct citation matching moved to processing phase, introduced position field in all citations schemas and updated collapser taking position into account when merging citations details coming from 3 variuos sources: fuzzy citationmatching, direct citationmatching, references metadata
merging trunk changes with IIS-CDH-5.3.0 branch
#1212 updating taxomonies database, introducing acm taxonomy classification, introducing acm classes support in exporter module, updating integration tests
#1315 propagating confidenceLevel to DocumentToConceptIds. Updating PIG transformer script by introducing concept identifiers deduplication UDF function picking record with the highest confidence level, introducing unit and integration tests. Propagating changes in document to concepts exporter module.
#1329 adding affiliations field in ExtractedDocumentMetadata PMC schema. Metadata extraction code refactoring by extracting code responsible for building Affiliation avro records to AffiliationBuilder class and sharing it with pmc ingestion. Implementing affiliations ingestion functionality in PmcXmlHandler covered with unit tests. Adding affiliations field support in ingest pmc metadata transformer.
#1306 introducing dummy field in DocumentId schema required to overcome https://issues.apache.org/jira/browse/PIG-3358 issue. Handling dummy filed in transformer pig scripts when it is required. Should be reverted as soon as PIG-3358 issue is fixed
removing avro-maven-plugin versioning conflicting with ${iis.avro.version}
fixing parent version to 1.0.1-CDH-5.3.0-SNAPSHOT
fixing version to 1.0.1-CDH-5.3.0-SNAPSHOT
#1247 renaming inputEntityId to inputObjectId because not all objects are entities (e.g. metadataextraction input)
#1247 renaming id field to more descriptive inputEntityId
#1247 introducing third draft of Fault avro schema: adding missing stracktrace
#1247 introducing second draft of Fault avro schema: refactoring recursive causes to array of causes
#1247 introducing first draft of Fault avro schema
dependencies cleanup: removing protocol buffer dependency from schemas, only avro should be supported
creating IIS-CDH-5.3.0 branch
introducing branches folder
#118 introducing madis based communities generation for website usage analysis
[maven-release-plugin] prepare for next development iteration
[maven-release-plugin] copy for tag icm-iis-schemas-1.0.0
[maven-release-plugin] prepare release icm-iis-schemas-1.0.0
#1044 pre-release switching to released version of parent pom and released dependencies
introducing scm definition
#919 renaming DocumentToResearchInitiative to DocumentToConceptId and DocumentToResearchInitiatives to DocumentToConceptIds
#1022 removing ExtractedDocumentMetadata envelope: origin info is not required
#1022 introducing ExtractedDocumentMetadata envelope required for collapsing PMC metadata records
removing unused import statement
fixing indent
#919 introducing Concept schema and importer module producing avro datastore based on XML profile
#686 introducing ExtractedDocumentMetadataEnvelope schema definition
#577 introducing citation envelope
#1017 introducing PMC extracted metadata schema
introducing detailed confidenceLevel field description placed in external eu/dnetlib/iis/README.markdown file
#963 introducing DocumentToMDStore datastore definition holding mappings between dataset identifier and mdstore indetifier holding given dataset
#720 confidence level description
#118 introducing LogEntry related comment in avdl file
#118 introducing log entry schema
#913 renaming DocumentContentUrl#contentSize to DocumentContentUrl#contentSizeKB changing field type from int to long, importing content size from ObjectStoreFile#fileSizeKB, updating dnet-objectstore-rmi dependency from 1.0.0 to 2.0.1-SNAPSHOT
#913 changing DocumentContentUrl#contentSize field type from string to int
#913 introducing DocumentContentUrl#contentSize field, handling it properly in all PIG transformers
#840 moving IdentifierMapping from importer to common package
#840 renaming DeduplicationMapping to more generic IdentifierMapping
introducing PmidMapping schema
updating countryCode comment to: country ISO 3166-1 alpha-2 uppercased code
Adding info on where to find types of citations produced by PMC citation ingestmodule
fixing comment
introducing address field
adding new countryCode field to affiliation
created tag folder for release
#577 updating Citation namespace
#577 introducing common.citations.Citation schema
removing redundant ReferenceBasicMetadata and ReferenceMetadata definitions which are also available in standalone avdl definitions, replacing definitions with import statements.
removing deprecated PersonWithInferencedData avro schema
removing deprecated DocumentWithInferencedData and DataSetReferenceWithInferencedData avro schemas
#568 renaming Citations#sourceDocumentId field to Citations#documentId
#568 introducing CitationEntry and Citations schemas
rename a field