Updates

  • The use of phantomjs has been deprecated. Check Wikipedia for more info.
  • The updated code for Safari webriver is under development as starting Safari 10 on OS X El Capitan and macOS Sierra, Safari comes bundled with a new driver implementation.
  • Opera is now based on ChromeDriver and has been adapted by Opera that enables programmatic automation of Chromium-based Opera products but hasn't been updated since April 5, 2017. We have suspended its support and removed from the code.(link)
  • Headless mode has been added for Chrome and Firefox. Set selenium.enable.headless to true in nutch-default.xml or nutch-site.xml to use it.

Your can run Nutch in Docker. Check some examples at https://github.com/sbatururimi/nutch-test. Don't forget to update Dockefile to point to the original Nutch repository when updated.

Contributors

Stas Batururimi [s.batururimi@gmail.com]