2025-12-26 08:03:00 [scrapy.utils.log] INFO: Scrapy 2.8.0 started (bot: modularspiders) 2025-12-26 08:03:00 [scrapy.utils.log] INFO: Versions: lxml 4.9.2.0, libxml2 2.9.14, cssselect 1.2.0, parsel 1.7.0, w3lib 2.1.1, Twisted 22.10.0, Python 3.10.15 (main, Oct 19 2024, 04:00:28) [GCC 12.2.0], pyOpenSSL 23.1.1 (OpenSSL 3.1.0 14 Mar 2023), cryptography 40.0.1, Platform Linux-5.10.0-21-amd64-x86_64-with-glibc2.36 2025-12-26 08:03:00 [scrapy.crawler] INFO: Overridden settings: {'BOT_NAME': 'modularspiders', 'CLOSESPIDER_TIMEOUT': 3600, 'LOG_FILE': 'logs/default/nerdaudio/ea296540e23011f097600242ac11000b.log', 'NEWSPIDER_MODULE': 'modularspiders.spiders', 'ROBOTSTXT_OBEY': True, 'SPIDER_MODULES': ['modularspiders.spiders'], 'USER_AGENT': 'Mozilla/5.0 (compatible; Googlebot/2.1; ' '+http://www.google.com/bot.html)'} 2025-12-26 08:03:00 [py.warnings] WARNING: /code/lib/python3.10/site-packages/scrapy/utils/request.py:232: ScrapyDeprecationWarning: '2.6' is a deprecated value for the 'REQUEST_FINGERPRINTER_IMPLEMENTATION' setting. It is also the default value. In other words, it is normal to get this warning if you have not defined a value for the 'REQUEST_FINGERPRINTER_IMPLEMENTATION' setting. This is so for backward compatibility reasons, but it will change in a future version of Scrapy. See the documentation of the 'REQUEST_FINGERPRINTER_IMPLEMENTATION' setting for information on how to handle this deprecation. return cls(crawler) 2025-12-26 08:03:00 [scrapy.utils.log] DEBUG: Using reactor: twisted.internet.epollreactor.EPollReactor 2025-12-26 08:03:00 [scrapy.extensions.telnet] INFO: Telnet Password: aef328ca9b1cdbd1 2025-12-26 08:03:00 [scrapy.middleware] INFO: Enabled extensions: ['scrapy.extensions.corestats.CoreStats', 'scrapy.extensions.telnet.TelnetConsole', 'scrapy.extensions.memusage.MemoryUsage', 'scrapy.extensions.logstats.LogStats', 'scrapy.extensions.closespider.CloseSpider'] 2025-12-26 08:03:00 [scrapy.middleware] INFO: Enabled downloader middlewares: ['scrapy.downloadermiddlewares.robotstxt.RobotsTxtMiddleware', 'scrapy.downloadermiddlewares.httpauth.HttpAuthMiddleware', 'scrapy.downloadermiddlewares.downloadtimeout.DownloadTimeoutMiddleware', 'scrapy.downloadermiddlewares.defaultheaders.DefaultHeadersMiddleware', 'scrapy.downloadermiddlewares.useragent.UserAgentMiddleware', 'scrapy.downloadermiddlewares.retry.RetryMiddleware', 'scrapy.downloadermiddlewares.redirect.MetaRefreshMiddleware', 'scrapy.downloadermiddlewares.httpcompression.HttpCompressionMiddleware', 'scrapy.downloadermiddlewares.redirect.RedirectMiddleware', 'scrapy.downloadermiddlewares.cookies.CookiesMiddleware', 'scrapy.downloadermiddlewares.httpproxy.HttpProxyMiddleware', 'scrapy.downloadermiddlewares.stats.DownloaderStats'] 2025-12-26 08:03:00 [scrapy.middleware] INFO: Enabled spider middlewares: ['scrapy.spidermiddlewares.httperror.HttpErrorMiddleware', 'scrapy.spidermiddlewares.offsite.OffsiteMiddleware', 'scrapy.spidermiddlewares.referer.RefererMiddleware', 'scrapy.spidermiddlewares.urllength.UrlLengthMiddleware', 'scrapy.spidermiddlewares.depth.DepthMiddleware'] 2025-12-26 08:03:00 [scrapy.middleware] INFO: Enabled item pipelines: ['modularspiders.pipelines.MinimumViableItemPipeline', 'modularspiders.pipelines.RedisExporter', 'modularspiders.pipelines.BulkStoreUpdate'] 2025-12-26 08:03:00 [scrapy.core.engine] INFO: Spider opened 2025-12-26 08:03:00 [scrapy.extensions.logstats] INFO: Crawled 0 pages (at 0 pages/min), scraped 0 items (at 0 items/min) 2025-12-26 08:03:00 [scrapy.extensions.telnet] INFO: Telnet console listening on 127.0.0.1:6034 2025-12-26 08:03:01 [scrapy.core.engine] DEBUG: Crawled (200) (referer: None) 2025-12-26 08:03:02 [scrapy.core.engine] DEBUG: Crawled (200) (referer: None) 2025-12-26 08:03:02 [scrapy.dupefilters] DEBUG: Filtered duplicate request: - no more duplicates will be shown (see DUPEFILTER_DEBUG to show all duplicates) 2025-12-26 08:03:02 [nerdaudio] INFO: Found next: https://nerdaudio.com/collections/modular_page-2 2025-12-26 08:03:03 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://nerdaudio.com/collections/modular) 2025-12-26 08:03:03 [scrapy.core.scraper] ERROR: Spider error processing (referer: https://nerdaudio.com/collections/modular) Traceback (most recent call last): File "/code/lib/python3.10/site-packages/twisted/internet/defer.py", line 892, in _runCallbacks current.result = callback( # type: ignore[misc] File "/code/modularspiders/spiders/nerdaudio.py", line 36, in parse_product if has_canonical_mismatch(response): NameError: name 'has_canonical_mismatch' is not defined 2025-12-26 08:03:03 [scrapy.core.engine] DEBUG: Crawled (200) (referer: None) 2025-12-26 08:03:03 [nerdaudio] INFO: Found next: https://nerdaudio.com/collections/used-gear_page-2 2025-12-26 08:03:03 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://nerdaudio.com/collections/used-gear) 2025-12-26 08:03:03 [scrapy.core.scraper] ERROR: Spider error processing (referer: https://nerdaudio.com/collections/used-gear) Traceback (most recent call last): File "/code/lib/python3.10/site-packages/twisted/internet/defer.py", line 892, in _runCallbacks current.result = callback( # type: ignore[misc] File "/code/modularspiders/spiders/nerdaudio.py", line 36, in parse_product if has_canonical_mismatch(response): NameError: name 'has_canonical_mismatch' is not defined 2025-12-26 08:03:03 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://nerdaudio.com/collections/modular) 2025-12-26 08:03:03 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://nerdaudio.com/collections/used-gear) 2025-12-26 08:03:03 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://nerdaudio.com/collections/modular) 2025-12-26 08:03:03 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://nerdaudio.com/collections/modular) 2025-12-26 08:03:03 [nerdaudio] INFO: Found next: https://nerdaudio.com/collections/modular_page-3 2025-12-26 08:03:03 [scrapy.core.scraper] ERROR: Spider error processing (referer: https://nerdaudio.com/collections/used-gear) Traceback (most recent call last): File "/code/lib/python3.10/site-packages/twisted/internet/defer.py", line 892, in _runCallbacks current.result = callback( # type: ignore[misc] File "/code/modularspiders/spiders/nerdaudio.py", line 36, in parse_product if has_canonical_mismatch(response): NameError: name 'has_canonical_mismatch' is not defined 2025-12-26 08:03:03 [scrapy.core.scraper] ERROR: Spider error processing (referer: https://nerdaudio.com/collections/modular) Traceback (most recent call last): File "/code/lib/python3.10/site-packages/twisted/internet/defer.py", line 892, in _runCallbacks current.result = callback( # type: ignore[misc] File "/code/modularspiders/spiders/nerdaudio.py", line 36, in parse_product if has_canonical_mismatch(response): NameError: name 'has_canonical_mismatch' is not defined 2025-12-26 08:03:03 [scrapy.core.scraper] ERROR: Spider error processing (referer: https://nerdaudio.com/collections/modular) Traceback (most recent call last): File "/code/lib/python3.10/site-packages/twisted/internet/defer.py", line 892, in _runCallbacks current.result = callback( # type: ignore[misc] File "/code/modularspiders/spiders/nerdaudio.py", line 36, in parse_product if has_canonical_mismatch(response): NameError: name 'has_canonical_mismatch' is not defined 2025-12-26 08:03:04 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://nerdaudio.com/collections/used-gear) 2025-12-26 08:03:04 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://nerdaudio.com/collections/modular_page-2) 2025-12-26 08:03:04 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://nerdaudio.com/collections/modular_page-2) 2025-12-26 08:03:04 [nerdaudio] INFO: Found next: https://nerdaudio.com/collections/modular_page-4 2025-12-26 08:03:04 [scrapy.core.scraper] ERROR: Spider error processing (referer: https://nerdaudio.com/collections/modular_page-2) Traceback (most recent call last): File "/code/lib/python3.10/site-packages/twisted/internet/defer.py", line 892, in _runCallbacks current.result = callback( # type: ignore[misc] File "/code/modularspiders/spiders/nerdaudio.py", line 36, in parse_product if has_canonical_mismatch(response): NameError: name 'has_canonical_mismatch' is not defined 2025-12-26 08:03:04 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://nerdaudio.com/collections/modular) 2025-12-26 08:03:04 [scrapy.core.scraper] ERROR: Spider error processing (referer: https://nerdaudio.com/collections/modular) Traceback (most recent call last): File "/code/lib/python3.10/site-packages/twisted/internet/defer.py", line 892, in _runCallbacks current.result = callback( # type: ignore[misc] File "/code/modularspiders/spiders/nerdaudio.py", line 36, in parse_product if has_canonical_mismatch(response): NameError: name 'has_canonical_mismatch' is not defined 2025-12-26 08:03:04 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://nerdaudio.com/collections/modular_page-3) 2025-12-26 08:03:04 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://nerdaudio.com/collections/modular_page-3) 2025-12-26 08:03:05 [scrapy.core.scraper] ERROR: Spider error processing (referer: https://nerdaudio.com/collections/modular_page-3) Traceback (most recent call last): File "/code/lib/python3.10/site-packages/twisted/internet/defer.py", line 892, in _runCallbacks current.result = callback( # type: ignore[misc] File "/code/modularspiders/spiders/nerdaudio.py", line 36, in parse_product if has_canonical_mismatch(response): NameError: name 'has_canonical_mismatch' is not defined 2025-12-26 08:03:05 [scrapy.core.engine] INFO: Closing spider (finished) 2025-12-26 08:03:05 [urllib3.connectionpool] DEBUG: Starting new HTTPS connection (1): wigglehunt.com:443 2025-12-26 08:03:05 [py.warnings] WARNING: /code/lib/python3.10/site-packages/urllib3/connectionpool.py:1045: InsecureRequestWarning: Unverified HTTPS request is being made to host 'wigglehunt.com'. Adding certificate verification is strongly advised. See: https://urllib3.readthedocs.io/en/1.26.x/advanced-usage.html#ssl-warnings warnings.warn( 2025-12-26 08:03:05 [urllib3.connectionpool] DEBUG: https://wigglehunt.com:443 "GET /api/v1/_ingest-complete?store_code=nerdaudio&batch_id=1887 HTTP/1.1" 200 15 2025-12-26 08:03:05 [scrapy.statscollectors] INFO: Dumping Scrapy stats: {'downloader/request_bytes': 5017, 'downloader/request_count': 15, 'downloader/request_method_count/GET': 15, 'downloader/response_bytes': 360889, 'downloader/response_count': 15, 'downloader/response_status_count/200': 15, 'dupefilter/filtered': 72, 'elapsed_time_seconds': 4.540695, 'finish_reason': 'finished', 'finish_time': datetime.datetime(2025, 12, 26, 8, 3, 5, 418150), 'httpcompression/response_bytes': 1481378, 'httpcompression/response_count': 14, 'log_count/DEBUG': 19, 'log_count/ERROR': 8, 'log_count/INFO': 14, 'log_count/WARNING': 2, 'memusage/max': 77066240, 'memusage/startup': 77066240, 'request_depth_max': 3, 'response_received_count': 15, 'robotstxt/request_count': 1, 'robotstxt/response_count': 1, 'robotstxt/response_status_count/200': 1, 'scheduler/dequeued': 14, 'scheduler/dequeued/memory': 14, 'scheduler/enqueued': 14, 'scheduler/enqueued/memory': 14, 'spider_exceptions/NameError': 8, 'start_time': datetime.datetime(2025, 12, 26, 8, 3, 0, 877455)} 2025-12-26 08:03:05 [scrapy.core.engine] INFO: Spider closed (finished)