Project

General

Profile

Statistics
| Revision:
Name Size Revision Age Author Comment
  src 38142 over 9 years Marek Horst #1147 preserving newlines when ingesting plaint...
deploy.info 416 Bytes 34902 over 9 years Marek Horst #1047 renaming icm-iis-ingest-webcrawl SVN loca...
pom.xml 2.98 KB 38953 about 9 years Marek Horst fixing scm location
  • svn:ignore: .* target

Latest revisions

# Date Author Comment
38953 02/09/2015 02:01 PM Marek Horst

fixing scm location

38898 01/09/2015 09:56 AM Lukasz Dumiszewski

target folders to svn:ignore

38142 09/07/2015 01:09 PM Marek Horst

#1147 preserving newlines when ingesting plaintext from htmls. This will eliminate some of the false positives in reference extraction algorithms

36288 09/04/2015 07:10 PM Marek Horst

#1257 dropping schema generation related hacks in all map-reduce modules, switching to literal schema parameters

35707 27/03/2015 09:41 AM Marek Horst

#1135 switching icm-iis-parent-container version to 1.0.1-SNAPSHOT in order to include workingDir related changes made in icm-iis-core

34912 27/02/2015 06:57 PM Marek Horst

#1147 renaming toplaintext wf name with plaintext to be more appriopriate

34911 27/02/2015 06:56 PM Marek Horst

#1147 renaming toplaintext dir name with plaintext to be more appriopriate

34906 27/02/2015 06:18 PM Marek Horst

#1147 introducing first version of html->plaintext ingester utilizing jsoup library

34902 27/02/2015 05:39 PM Marek Horst

#1047 renaming icm-iis-ingest-webcrawl SVN location to icm-iis-ingest

34897 27/02/2015 05:37 PM Marek Horst

#1047 renaming icm-iis-ingest-webcrawl SVN location to icm-iis-ingest

View revisions

Also available in: Atom