注册 登录
编程论坛 Python论坛

求助贴,python3中scrapy爬虫错误

HC__smile 发布于 2019-05-08 17:15, 1319 次点击
import scrapy
from douban_book.items import DoubanBookItem
 
 
class BookSpider(scrapy.Spider):
    """docstring for BookSpider"""
    name = 'douban-book'
    allowed_domain = ['']
    start_urls = ['https://book.']
 
    def parse(self, response):
        yield scrapy.Request(response.url, callback = self.parse_page)
 
        for page in response.xpath('//div[@class="paginator"]/a'):
            link = page.xpath('@href').extract()[0]
            yield scrapy.Request(link, callback = self.parse_page)
 
    def parse_page(self, response):
        for item in response.xpath('//tr[@class="item"]'):
            book = DoubanBookItem()
            book['name'] = item.xpath('td[2]/div[1]/a/@title').extract()[0]
            book['ratings'] =item.xpath('td[2]/div[2]/span[@class="rating_nums"]/text()').extract()[0]
            # book['ratings'] = item.xpath('td[2]/div[2]/span[2]/text()').extract()[0]
            book_info = item.xpath('td[2]/p[1]/text()').extract()[0]
            book_info_contents = book_info.strip().split(' / ')
            book['author'] = book_info_contents[0]
            book['publisher'] = book_info_contents[1]
            book['edition_year'] = book_info_contents[2]
            book['price'] = book_info_contents[3]
            yield book
错误:C:\anacoda\python.exe D:/scrapy/douban_book/douban_book/spiders/bookspider.py.py
Traceback (most recent call last):
  File "D:/scrapy/douban_book/douban_book/spiders/bookspider.py.py", line 6, in <module>
    from douban_book.items import DoubanBookItem
ModuleNotFoundError: No module named 'douban_book'
这是什么错误呀,谢谢!
1 回复
#2
wp2319572019-05-09 07:47
没有douban_book这个模块,多清晰的提示啊
1