How to change configuration of apache nutch when it is crawling

My crawler (apache nutch2.2.1) is in crawling state. I have to change some configurations of crawler in nutch-site.xml. I have come to know that when crawler is in running state, avoid to change configuration.

My question is.

Can we change configurations of crawler in running state?
If yes then is there any cations when doing some changes in crawler?
or If we could not change configuration of crawler, then what are its drawbacks if configurations are changed?

Solution

Nutch 2.2.1 crawling is a loop of Hadoop jobs, we can change the configuration of the Nutch crawler during runtime, however the changing only is activated in the next Hadoop job. For example, if you change the configuration during generating job, the changing is activated in fetching job.

Hope this helps,

Le Quoc Do

How to block a specific user agent in Apache
Allow access to page only from certain referrer
How can I solve "Error: MySQL shutdown unexpectedly"?
How to enable .htaccess support with PHP-FPM, Nginx, and Apache2 on Ubuntu 24.04 with FastPanel?
Wordpress REST API (wp-api) 404 Error: Cannot access the WordPress REST API
Apache POI autoSizeColumn Resizes Incorrectly
How to simulate DDOS/Slashdotting?
How to do load testing of a PHP website from both ends
Testing Apache/mod_jk/Tomcat configuration upgrade
How to solve "Error: Apache shutdown unexpectedly"?
“[notice] child pid XXXX exit signal Segmentation fault (11)” in apache error.log
Apache giving 403 forbidden errors
NoClassDefFoundError: org/apache/commons/lang3/StringUtils
I want to remove index.php in the link
How to get Apache Server to display test page
Reopening a session in PHP
How can I make sure AuthName works in all browsers?
map subfolder content to root with .htaccess
Benefits of ServerAdmin Directive in Apache2
Apache is not running from XAMPP Control Panel ( Error: Apache shutdown unexpectedly. This may be due to a blocked port)
Why can't my mod_wsgi module find "libpython3.7m.so.1.0" even though it exists?
index.php not loading by default
Fatal error: Class 'imagick' not found
Redirect HTTP to HTTPS on default virtual host without ServerName
How to get a pandas dataframe from an apache log?
apache2 service Failed on restart - Failed to start apache2.service: Unit not found
Unable to establish SSL connection, how do I fix my SSL cert?
Cannot resolve symbol 'IOUtils'
phpMyAdmin in Xampp not working
Leaflet.js - can't get my current location on some browsers