Re: [dotcms] Can't index the Site Search of dotCMS 1.9.5

44 views
Skip to first unread message

Maria Ahues Bouza

unread,
Jul 25, 2012, 1:29:09 PM7/25/12
to dot...@googlegroups.com
You're running dotCMS with 1GB of memory and we recommend running with at least 4GB of memory
http://dotcms.com/products/technology/hardware-specs.dot

If you get more memory on the server and up the memory in the startup file it should run properly.




On Wed, Jul 25, 2012 at 6:03 AM, <maxat...@gmail.com> wrote:
Hello,

I get these errors when I want to index the site for the Site Search. My settings in the startup.bat are these:
set JAVA_OPTS=-Djava.awt.headless=true -Xverify:none -Dfile.encoding=UTF8 -server -Xms256M -Xmx1024M -XX:PermSize=256m

So that should be enough I think. When I set the -Xmx bigger the program doesn't start anymore.

java.io.IOException: Job failed!
    at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1252)
    at org.apache.nutch.crawl.Injector.inject(Injector.java:211)
    at com.dotmarketing.sitesearch.CrawlerUtil.indexHost(CrawlerUtil.java:292)
    at com.dotmarketing.sitesearch.CrawlerUtil.index(CrawlerUtil.java:162)
    at com.dotmarketing.sitesearch.job.SiteSearchJobImpl.run(SiteSearchJobImpl.java:102)
    at com.dotmarketing.sitesearch.job.SiteSearchJobProxy.run(SiteSearchJobProxy.java:16)
    at com.dotmarketing.quartz.DotJob.execute(DotJob.java:42)
    at org.quartz.core.JobRunShell.run(JobRunShell.java:202)
    at org.quartz.simpl.SimpleThreadPool$WorkerThread.run(SimpleThreadPool.java:529)
[25/07/12 14:45:18:513 CEST] ERROR sitesearch.CrawlerUtil: Error indexing host localhost caused by = com.dotmarketing.exception.DotRuntimeException: Job failed!
[25/07/12 14:45:18:534 CEST]  INFO sitesearch.CrawlerUtil: indexHost : Starting an Site Search index on host localhost
[25/07/12 14:45:18:534 CEST]  INFO util.AdminLogger: class com.dotmarketing.sitesearch.CrawlerUtil : indexHost : Starting an Site Search index on host localhost
[25/07/12 14:45:18:537 CEST]  INFO sitesearch.CrawlerUtil: site search crawl started in: C:/dotCMS/dotCMS/assets/search_index/localhost/48190c8c-42c4-46af-8d1a-0cd5db894797_temp/crawl-index
[25/07/12 14:45:18:537 CEST]  INFO sitesearch.CrawlerUtil: rootUrlDir = C:/dotCMS/dotCMS/assets/search_index/localhost/url_folder
[25/07/12 14:45:18:537 CEST]  INFO sitesearch.CrawlerUtil: threads = 10
[25/07/12 14:45:18:537 CEST]  INFO sitesearch.CrawlerUtil: depth = 20
[25/07/12 14:45:18:537 CEST]  INFO sitesearch.CrawlerUtil: indexer=lucene



-------------

java.lang.OutOfMemoryError: Java heap space
[25/07/12 14:44:16:127 CEST]  INFO mapred.JobClient:  map 0% reduce 0%
[25/07/12 14:44:16:127 CEST]  INFO mapred.JobClient: Job complete: job_local_0022
[25/07/12 14:44:16:127 CEST]  INFO mapred.JobClient: Counters: 0
[25/07/12 14:44:16:162 CEST]  INFO crawl.Injector: Injector: starting
[25/07/12 14:44:16:162 CEST]  INFO crawl.Injector: Injector: crawlDb: C:/dotCMS/dotCMS/assets/search_index/localhost/48190c8c-42c4-46af-8d1a-0cd5db894797_temp/crawl-index/crawldb
[25/07/12 14:44:16:162 CEST]  INFO crawl.Injector: Injector: urlDir: C:/dotCMS/dotCMS/assets/search_index/localhost/url_folder
[25/07/12 14:44:16:162 CEST]  INFO crawl.Injector: Injector: Converting injected urls to crawl db entries.
[25/07/12 14:44:16:162 CEST]  INFO jvm.JvmMetrics: Cannot initialize JVM Metrics with processName=JobTracker, sessionId= - already initialized
[25/07/12 14:44:16:163 CEST]  WARN mapred.JobClient: Use GenericOptionsParser for parsing the arguments. Applications should implement Tool for the same.
[25/07/12 14:44:16:248 CEST]  INFO mapred.FileInputFormat: Total input paths to process : 1
[25/07/12 14:44:16:315 CEST]  INFO mapred.LocalJobRunner: file:/C:/dotCMS/dotCMS/assets/search_index/localhost/url_folder/urls.txt:0+18
[25/07/12 14:44:16:333 CEST]  INFO mapred.JobClient: Running job: job_local_0023
[25/07/12 14:44:16:334 CEST]  INFO mapred.FileInputFormat: Total input paths to process : 1
[25/07/12 14:44:16:353 CEST]  INFO mapred.MapTask: numReduceTasks: 1
[25/07/12 14:44:16:353 CEST]  INFO mapred.MapTask: io.sort.mb = 100
[25/07/12 14:44:16:932 CEST]  WARN mapred.LocalJobRunner: job_local_0023
java.lang.OutOfMemoryError: Java heap space
[25/07/12 14:44:17:333 CEST]  INFO mapred.JobClient:  map 0% reduce 0%
[25/07/12 14:44:17:333 CEST]  INFO mapred.JobClient: Job complete: job_local_0023
[25/07/12 14:44:17:333 CEST]  INFO mapred.JobClient: Counters: 0
[25/07/12 14:44:17:359 CEST]  INFO crawl.Injector: Injector: starting
[25/07/12 14:44:17:359 CEST]  INFO crawl.Injector: Injector: crawlDb: C:/dotCMS/dotCMS/assets/search_index/localhost/48190c8c-42c4-46af-8d1a-0cd5db894797_temp/crawl-index/crawldb
[25/07/12 14:44:17:359 CEST]  INFO crawl.Injector: Injector: urlDir: C:/dotCMS/dotCMS/assets/search_index/localhost/url_folder
[25/07/12 14:44:17:359 CEST]  INFO crawl.Injector: Injector: Converting injected urls to crawl db entries.
[25/07/12 14:44:17:360 CEST]  INFO jvm.JvmMetrics: Cannot initialize JVM Metrics with processName=JobTracker, sessionId= - already initialized
[25/07/12 14:44:17:382 CEST]  WARN mapred.JobClient: Use GenericOptionsParser for parsing the arguments. Applications should implement Tool for the same.
[25/07/12 14:44:17:447 CEST]  INFO mapred.FileInputFormat: Total input paths to process : 1
[25/07/12 14:44:17:508 CEST]  INFO mapred.LocalJobRunner: file:/C:/dotCMS/dotCMS/assets/search_index/localhost/url_folder/urls.txt:0+18
[25/07/12 14:44:17:536 CEST]  INFO mapred.JobClient: Running job: job_local_0024
[25/07/12 14:44:17:537 CEST]  INFO mapred.FileInputFormat: Total input paths to process : 1
[25/07/12 14:44:17:558 CEST]  INFO mapred.MapTask: numReduceTasks: 1

--
You received this message because you are subscribed to the Google Groups "dotCMS User Group" group.
To view this discussion on the web visit https://groups.google.com/d/msg/dotcms/-/-9Cmk3Jbw3QJ.
To post to this group, send email to dot...@googlegroups.com.
To unsubscribe from this group, send email to dotcms+un...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/dotcms?hl=en.



--
Community Manager

dotCMS
Main: 305.900.2001
Fax: 305.397.2579
www.dotcms.com
http://www.twitter.com/dotCMS
http://www.facebook.com/dotCMS
http://www.twitter.com/mabouza

Please consider the planet before printing this email.
Reply all
Reply to author
Forward
0 new messages