mailinglist-archive.com
Mailing list home
dev.nutch.apache.org
2012-02
dev.nutch.apache.org -
2012-02 - listed by date
switch to thread view
show all months
2012-03
2012-02
2012-01
2011-12
2011-11
2011-10
2011-09
2011-08
2011-07
2011-06
2011-05
2011-04
2011-03
2011-02
2011-01
2010-12
2010-11
dev.nutch.apache.org (date)
Page(s): 1 |
2
|
3
of 3
February 29, 2012
[jira] [Commented] (NUTCH-1291) Fetcher to stringify exception on // unexpected exception
[jira] [Created] (NUTCH-1292) Better exception logging and debugging during fetch.
[jira] [Commented] (NUTCH-1291) Fetcher to stringify exception on // unexpected exception
Re: NUTCH-1273
[jira] [Resolved] (NUTCH-1291) Fetcher to stringify exception on // unexpected exception
[jira] [Commented] (NUTCH-1291) Fetcher to stringify exception on // unexpected exception
NUTCH-1273
[jira] [Updated] (NUTCH-1291) Fetcher to stringify exception on // unexpected exception
[jira] [Updated] (NUTCH-1291) Fetcher to stringify exception on // unexpected exception
[jira] [Created] (NUTCH-1291) Fetcher to stringify exception on // unexpected exception
[jira] [Commented] (NUTCH-945) Indexing to multiple SOLR Servers
Fwd: [blog post] Accumulo, Nutch, and Gora
[jira] [Updated] (NUTCH-945) Indexing to multiple SOLR Servers
[jira] [Updated] (NUTCH-945) Indexing to multiple SOLR Servers
[jira] [Updated] (NUTCH-945) Indexing to multiple SOLR Servers
[jira] [Commented] (NUTCH-945) Indexing to multiple SOLR Servers
February 28, 2012
Re: [nutchgora] AbstractFetchSchedule.forceFetch method resets fetch status
Re: [nutchgora] AbstractFetchSchedule.forceFetch method resets fetch status
[jira] [Commented] (NUTCH-1001) bin/nutch fetch/parse handle crawl/segments directory
[jira] [Updated] (NUTCH-1001) bin/nutch fetch/parse handle crawl/segments directory
[nutchgora] AbstractFetchSchedule.forceFetch method resets fetch status
[jira] [Commented] (NUTCH-1001) bin/nutch fetch/parse handle crawl/segments directory
[jira] [Commented] (NUTCH-670) feed plugin does not parse RSS2 enclosures
[jira] [Created] (NUTCH-1290) crawlId not supported by all Tools
February 27, 2012
[jira] [Commented] (NUTCH-670) feed plugin does not parse RSS2 enclosures
[jira] [Commented] (NUTCH-1289) In distributed mode URL's are not partitioned
[jira] [Commented] (NUTCH-1289) In distributed mode URL's are not partitioned
[jira] [Commented] (NUTCH-1289) In distributed mode URL's are not partitioned
[jira] [Commented] (NUTCH-1289) In distributed mode URL's are not partitioned
[jira] [Updated] (NUTCH-1289) In distributed mode URL's are not partitioned
[jira] [Created] (NUTCH-1289) In distributed mode URL's are not partitioned
[jira] [Updated] (NUTCH-1289) In distributed mode URL's are not partitioned
Jenkins build is back to normal : nutch-trunk-maven #173
February 26, 2012
[jira] [Commented] (NUTCH-1286) Refactoring/reimplementing crawling API (NutchApp)
[jira] [Issue Comment Edited] (NUTCH-1001) bin/nutch fetch/parse handle crawl/segments directory
[jira] [Updated] (NUTCH-1001) bin/nutch fetch/parse handle crawl/segments directory
[jira] [Updated] (NUTCH-1273) Fix [deprecation] javac warnings
[jira] [Updated] (NUTCH-1273) Fix [deprecation] javac warnings
Re: Proposal to remove o.a.n.crawl.MapWritable from Nutch codebase.
Proposal to remove o.a.n.crawl.MapWritable from Nutch codebase.
[jira] [Commented] (NUTCH-1253) Incompatible neko and xerces versions
[jira] [Commented] (NUTCH-728) Improve nutch release packaging
[jira] [Commented] (NUTCH-1286) Refactoring/reimplementing crawling API (NutchApp)
Build failed in Jenkins: nutch-trunk-maven #172
February 25, 2012
[jira] [Commented] (NUTCH-670) feed plugin does not parse RSS2 enclosures
[Nutch Wiki] Trivial Update of "WhichTechnicalConceptsAreBehindTheNutchPluginSystem" by LewisJohnMcgibbney
[jira] [Commented] (NUTCH-965) Skip parsing for truncated documents
[jira] [Commented] (NUTCH-965) Skip parsing for truncated documents
February 24, 2012
[jira] [Commented] (NUTCH-965) Skip parsing for truncated documents
[jira] [Commented] (NUTCH-965) Skip parsing for truncated documents
[jira] [Commented] (NUTCH-965) Skip parsing for truncated documents
[jira] [Commented] (NUTCH-965) Skip parsing for truncated documents
[jira] [Commented] (NUTCH-965) Skip parsing for truncated documents
[jira] [Commented] (NUTCH-965) Skip parsing for truncated documents
[jira] [Commented] (NUTCH-965) Skip parsing for truncated documents
[jira] [Commented] (NUTCH-965) Skip parsing for truncated documents
[jira] [Commented] (NUTCH-965) Skip parsing for truncated documents
[jira] [Commented] (NUTCH-965) Skip parsing for truncated documents
Jenkins build is back to normal : Nutch-trunk #1767
[jira] [Commented] (NUTCH-965) Skip parsing for truncated documents
[jira] [Commented] (NUTCH-1210) DomainBlacklistFilter
February 23, 2012
[jira] [Commented] (NUTCH-1210) DomainBlacklistFilter
Re: svn commit: r1292764 - in /nutch/trunk: ./ conf/ src/plugin/ src/plugin/urlfilter-domainblacklist/ src/plugin/urlfilter-domainblacklist/data/ src/plugin/urlfilter-domainblacklist/src/ src/plugin/urlfilter-domainblacklist/src/java/ src/plugin/urlf
[jira] [Commented] (NUTCH-1210) DomainBlacklistFilter
[jira] [Commented] (NUTCH-965) Skip parsing for truncated documents
[jira] [Commented] (NUTCH-965) Skip parsing for truncated documents
Re: svn commit: r1292764 - in /nutch/trunk: ./ conf/ src/plugin/ src/plugin/urlfilter-domainblacklist/ src/plugin/urlfilter-domainblacklist/data/ src/plugin/urlfilter-domainblacklist/src/ src/plugin/urlfilter-domainblacklist/src/java/ src/plugin/urlf
Re: I think I found a bug --> multiple_values_encountered_for_non_multiValued_field_title
[jira] [Resolved] (NUTCH-1210) DomainBlacklistFilter
Jenkins build is back to normal : Nutch-nutchgora #172
[jira] [Commented] (NUTCH-965) Skip parsing for truncated documents
Build failed in Jenkins: Nutch-nutchgora #171
Re: slf4j-log4j12 new version causes runtime error
[jira] [Commented] (NUTCH-965) Skip parsing for truncated documents
[jira] [Commented] (NUTCH-965) Skip parsing for truncated documents
[jira] [Commented] (NUTCH-965) Skip parsing for truncated documents
[jira] [Commented] (NUTCH-965) Skip parsing for truncated documents
[jira] [Commented] (NUTCH-965) Skip parsing for truncated documents
[jira] [Commented] (NUTCH-965) Skip parsing for truncated documents
[jira] [Commented] (NUTCH-965) Skip parsing for truncated documents
Build failed in Jenkins: Nutch-trunk #1766
[jira] [Commented] (NUTCH-965) Skip parsing for truncated documents
Jenkins build is back to normal : Nutch-nutchgora #170
February 22, 2012
Re: slf4j-log4j12 new version causes runtime error
Re: slf4j-log4j12 new version causes runtime error
[jira] [Closed] (NUTCH-965) Skip parsing for truncated documents
[jira] [Resolved] (NUTCH-965) Skip parsing for truncated documents
[jira] [Commented] (NUTCH-965) Skip parsing for truncated documents
[jira] [Commented] (NUTCH-965) Skip parsing for truncated documents
[jira] [Updated] (NUTCH-965) Skip parsing for truncated documents
Jenkins build is back to normal : nutch-trunk-maven #161
Build failed in Jenkins: Nutch-nutchgora #169
I think I found a bug --> multiple_values_encountered_for_non_multiValued_field_title
February 21, 2012
slf4j-log4j12 new version causes runtime error
[jira] [Commented] (NUTCH-978) [GSoC 2011] A Plugin for extracting certain element of a web page on html page parsing.
[jira] [Commented] (NUTCH-978) [GSoC 2011] A Plugin for extracting certain element of a web page on html page parsing.
[jira] [Commented] (NUTCH-978) [GSoC 2011] A Plugin for extracting certain element of a web page on html page parsing.
[jira] [Commented] (NUTCH-978) [GSoC 2011] A Plugin for extracting certain element of a web page on html page parsing.
[jira] [Commented] (NUTCH-978) [GSoC 2011] A Plugin for extracting certain element of a web page on html page parsing.
[jira] [Commented] (NUTCH-978) [GSoC 2011] A Plugin for extracting certain element of a web page on html page parsing.
[jira] [Commented] (NUTCH-978) [GSoC 2011] A Plugin for extracting certain element of a web page on html page parsing.
[jira] [Commented] (NUTCH-978) [GSoC 2011] A Plugin for extracting certain element of a web page on html page parsing.
[jira] [Commented] (NUTCH-1281) tika parser not work properly with unwanted file types that passed from filters in nutch
[jira] [Resolved] (NUTCH-1288) Generator should not generate filter and not found and denied and gone and permanently moved pages
[jira] [Updated] (NUTCH-1288) Generator should not generate filter and not found and denied and gone and permanently moved pages
[jira] [Created] (NUTCH-1288) Generator should not generate filter and not found and denied and gone and permanently moved pages
Build failed in Jenkins: nutch-trunk-maven #160
[jira] [Commented] (NUTCH-1280) language-identifier should have option to use detected value by Tika even when uncertain
[jira] [Commented] (NUTCH-1287) Upgrade to hsqldb 2.2.8
February 20, 2012
Re: [DISCUSS] Nutchgora 2.0 release
[jira] [Updated] (NUTCH-1286) Refactoring/reimplementing crawling API (NutchApp)
Re: [DISCUSS] Nutchgora 2.0 release
[jira] [Closed] (NUTCH-1287) Upgrade to hsqldb 2.2.8
Jenkins build is back to normal : nutch-trunk-maven #159
Re: [DISCUSS] Nutchgora 2.0 release
[jira] [Created] (NUTCH-1287) Upgrade to hsqldb 2.2.8
[jira] [Reopened] (NUTCH-1277) Fix [fallthrough] javac warnings
Re: Build failed in Jenkins: nutch-trunk-maven #158
Re: Build failed in Jenkins: nutch-trunk-maven #158
Build failed in Jenkins: nutch-trunk-maven #158
Re: [DISCUSS] Nutchgora 2.0 release
[jira] [Created] (NUTCH-1286) Refactoring/reimplementing crawling API (NutchApp)
[jira] [Commented] (NUTCH-965) Skip parsing for truncated documents
[jira] [Closed] (NUTCH-1277) Fix [fallthrough] javac warnings
[jira] [Resolved] (NUTCH-1277) Fix [fallthrough] javac warnings
[jira] [Updated] (NUTCH-1285) Debian Packaging for Nutch
[jira] [Created] (NUTCH-1285) Debian Packaging for Nutch
[jira] [Updated] (NUTCH-1283) Radically update all Solr configuration in Nutchgora
[jira] [Issue Comment Edited] (NUTCH-1053) Parsing of RSS feeds fails
[jira] [Updated] (NUTCH-1053) Parsing of RSS feeds fails
[jira] [Commented] (NUTCH-1001) bin/nutch fetch/parse handle crawl/segments directory
[jira] [Commented] (NUTCH-1001) bin/nutch fetch/parse handle crawl/segments directory
[jira] [Resolved] (NUTCH-1280) language-identifier should have option to use detected value by Tika even when uncertain
[jira] [Commented] (NUTCH-1283) Ridically update all Solr configuration in Nutchgora
February 19, 2012
[jira] [Commented] (NUTCH-809) Parse-metatags plugin
[jira] [Commented] (NUTCH-1276) Fix [dep-ann]
[jira] [Commented] (NUTCH-978) [GSoC 2011] A Plugin for extracting certain element of a web page on html page parsing.
[jira] [Commented] (NUTCH-1276) Fix [dep-ann]
[jira] [Commented] (NUTCH-1276) Fix [dep-ann]
[jira] [Created] (NUTCH-1284) Add site fetcher.max.crawl.delay as log output by default.
[jira] [Updated] (NUTCH-978) [GSoC 2011] A Plugin for extracting certain element of a web page on html page parsing.
[jira] [Commented] (NUTCH-978) [GSoC 2011] A Plugin for extracting certain element of a web page on html page parsing.
[jira] [Closed] (NUTCH-1271) Fix errors @ compile time
[jira] [Resolved] (NUTCH-1271) Fix errors @ compile time
[jira] [Assigned] (NUTCH-1249) Resolve all issues flagged up by adding javac -Xlint arguement
[jira] [Commented] (NUTCH-1273) Fix [deprecation] javac warnings
[jira] [Resolved] (NUTCH-1276) Fix [dep-ann]
[jira] [Closed] (NUTCH-1276) Fix [dep-ann]
[jira] [Updated] (NUTCH-728) Improve nutch release packaging
[jira] [Updated] (NUTCH-1253) Incompatible neko and xerces versions
[jira] [Updated] (NUTCH-1253) Incompatible neko and xerces versions
[jira] [Commented] (NUTCH-929) Create a REST-based admin UI for Nutch
[Nutch Wiki] Trivial Update of "NutchAdministrationUserInterface" by LewisJohnMcgibbney
[Nutch Wiki] Trivial Update of "NutchAdministrationUserInterface" by LewisJohnMcgibbney
[Nutch Wiki] Trivial Update of "NutchAdministrationUserInterface" by LewisJohnMcgibbney
[Nutch Wiki] Trivial Update of "NutchAdministrationUserInterface" by LewisJohnMcgibbney
[Nutch Wiki] Trivial Update of "NutchAdministrationUserInterface" by LewisJohnMcgibbney
[Nutch Wiki] Trivial Update of "NutchAdministrationUserInterface" by LewisJohnMcgibbney
[Nutch Wiki] Trivial Update of "NutchAdministrationUserInterface" by LewisJohnMcgibbney
[Nutch Wiki] Trivial Update of "NutchAdministrationUserInterface" by LewisJohnMcgibbney
[Nutch Wiki] Trivial Update of "NutchAdministrationUserInterface" by LewisJohnMcgibbney
[Nutch Wiki] Trivial Update of "NutchAdministrationUserInterface" by LewisJohnMcgibbney
[jira] [Created] (NUTCH-1283) Ridically update all Solr configuration in Nutchgora
[jira] [Commented] (NUTCH-1278) Fetch Improvement in threads per host
[jira] [Commented] (NUTCH-1281) tika parser not work properly with unwanted file types that passed from filters in nutch
[jira] [Commented] (NUTCH-1281) tika parser not work properly with unwanted file types that passed from filters in nutch
[jira] [Created] (NUTCH-1282) linkdb scalability
[jira] [Commented] (NUTCH-1246) Upgrade to Hadoop 1.0.0
[jira] [Commented] (NUTCH-1278) Fetch Improvement in threads per host
[jira] [Updated] (NUTCH-1278) Fetch Improvement in threads per host
Page(s): 1 |
2
|
3
of 3
(C)2011 mailinglist-archive.com