当前位置: 首页 > news >正文

【Python爬虫】网页抓取实例之淘宝商品信息抓取

之前我们已经说过网页抓取的相关内容

上次我们是以亚马逊某网页的产品为例

抓取价格、品牌、型号、样式等

图片

该网页上价格、品牌、型号、样式等

都只有一个

如果网页上的目标内容

根据不同规格有多个

又该怎么提取呢?

▼如下图所示

当机身颜色、套餐、存储容量等

规格不同时

对应价格也不同

那该怎么获取不同属性下的价格呢?

 此时需要用到抓取商品详情信息的API接口item_get、或者item_get_sku。

详细调用步骤如下:

1、获取API调用权限,注册key和密钥。

2、进入API测试页,输入商品id,调用API。如下图:

3、找到响应示例中的sku信息

"skus": {"sku": [{"price": 678,"total_price": 0,"orginal_price": 678,"properties": "1627207:1007902496;5919063:6536025;12304035:1905146457","properties_name": "1627207:1007902496:机身颜色:幻夜黑;5919063:6536025:套餐类型:官方标配 华为nova5i;12304035:1905146457:存储容量:6+128GB","quantity": 200,"sku_id": "5142540523707"},{"price": 778,"total_price": 0,"orginal_price": 778,"properties": "1627207:1007902496;5919063:6536025;12304035:1963543205","properties_name": "1627207:1007902496:机身颜色:幻夜黑;5919063:6536025:套餐类型:官方标配 华为nova5i;12304035:1963543205:存储容量:8+128GB","quantity": 200,"sku_id": "5142540523708"},{"price": 868,"total_price": 0,"orginal_price": 868,"properties": "1627207:1007902496;5919063:3266779;12304035:1905146457","properties_name": "1627207:1007902496:机身颜色:幻夜黑;5919063:3266779:套餐类型:套餐一 华为nova5por;12304035:1905146457:存储容量:6+128GB","quantity": 0,"sku_id": "5141622086982"},{"price": 878,"total_price": 0,"orginal_price": 878,"properties": "1627207:1007902496;5919063:3266779;12304035:1963543205","properties_name": "1627207:1007902496:机身颜色:幻夜黑;5919063:3266779:套餐类型:套餐一 华为nova5por;12304035:1963543205:存储容量:8+128GB","quantity": 200,"sku_id": "5141622086983"},{"price": 678,"total_price": 0,"orginal_price": 678,"properties": "1627207:2771458827;5919063:6536025;12304035:1905146457","properties_name": "1627207:2771458827:机身颜色:苏音蓝;5919063:6536025:套餐类型:官方标配 华为nova5i;12304035:1905146457:存储容量:6+128GB","quantity": 200,"sku_id": "5142540523709"},{"price": 778,"total_price": 0,"orginal_price": 778,"properties": "1627207:2771458827;5919063:6536025;12304035:1963543205","properties_name": "1627207:2771458827:机身颜色:苏音蓝;5919063:6536025:套餐类型:官方标配 华为nova5i;12304035:1963543205:存储容量:8+128GB","quantity": 200,"sku_id": "5142540523710"},{"price": 868,"total_price": 0,"orginal_price": 868,"properties": "1627207:2771458827;5919063:3266779;12304035:1905146457","properties_name": "1627207:2771458827:机身颜色:苏音蓝;5919063:3266779:套餐类型:套餐一 华为nova5por;12304035:1905146457:存储容量:6+128GB","quantity": 0,"sku_id": "5141622086986"},{"price": 878,"total_price": 0,"orginal_price": 878,"properties": "1627207:2771458827;5919063:3266779;12304035:1963543205","properties_name": "1627207:2771458827:机身颜色:苏音蓝;5919063:3266779:套餐类型:套餐一 华为nova5por;12304035:1963543205:存储容量:8+128GB","quantity": 200,"sku_id": "5141622086987"},{"price": 678,"total_price": 0,"orginal_price": 678,"properties": "1627207:1198972384;5919063:6536025;12304035:1905146457","properties_name": "1627207:1198972384:机身颜色:蜜语红;5919063:6536025:套餐类型:官方标配 华为nova5i;12304035:1905146457:存储容量:6+128GB","quantity": 200,"sku_id": "5142540523705"},{"price": 778,"total_price": 0,"orginal_price": 778,"properties": "1627207:1198972384;5919063:6536025;12304035:1963543205","properties_name": "1627207:1198972384:机身颜色:蜜语红;5919063:6536025:套餐类型:官方标配 华为nova5i;12304035:1963543205:存储容量:8+128GB","quantity": 200,"sku_id": "5142540523706"},{"price": 868,"total_price": 0,"orginal_price": 868,"properties": "1627207:1198972384;5919063:3266779;12304035:1905146457","properties_name": "1627207:1198972384:机身颜色:蜜语红;5919063:3266779:套餐类型:套餐一 华为nova5por;12304035:1905146457:存储容量:6+128GB","quantity": 0,"sku_id": "5141622086984"},{"price": 878,"total_price": 0,"orginal_price": 878,"properties": "1627207:1198972384;5919063:3266779;12304035:1963543205","properties_name": "1627207:1198972384:机身颜色:蜜语红;5919063:3266779:套餐类型:套餐一 华为nova5por;12304035:1963543205:存储容量:8+128GB","quantity": 200,"sku_id": "5141622086985"},{"price": 678,"total_price": 0,"orginal_price": 678,"properties": "1627207:5860482491;5919063:6536025;12304035:1905146457","properties_name": "1627207:5860482491:机身颜色:烟雨青蓝;5919063:6536025:套餐类型:官方标配 华为nova5i;12304035:1905146457:存储容量:6+128GB","quantity": 200,"sku_id": "5142540523711"},{"price": 778,"total_price": 0,"orginal_price": 778,"properties": "1627207:5860482491;5919063:6536025;12304035:1963543205","properties_name": "1627207:5860482491:机身颜色:烟雨青蓝;5919063:6536025:套餐类型:官方标配 华为nova5i;12304035:1963543205:存储容量:8+128GB","quantity": 200,"sku_id": "5142540523712"},{"price": 868,"total_price": 0,"orginal_price": 868,"properties": "1627207:5860482491;5919063:3266779;12304035:1905146457","properties_name": "1627207:5860482491:机身颜色:烟雨青蓝;5919063:3266779:套餐类型:套餐一 华为nova5por;12304035:1905146457:存储容量:6+128GB","quantity": 0,"sku_id": "5141622086988"},{"price": 878,"total_price": 0,"orginal_price": 878,"properties": "1627207:5860482491;5919063:3266779;12304035:1963543205","properties_name": "1627207:5860482491:机身颜色:烟雨青蓝;5919063:3266779:套餐类型:套餐一 华为nova5por;12304035:1963543205:存储容量:8+128GB","quantity": 200,"sku_id": "5141622086989"}]},

 

相关文章:

  • 北京网站建设多少钱?
  • 辽宁网页制作哪家好_网站建设
  • 高端品牌网站建设_汉中网站制作
  • 探索未来,开启无限可能:打造智慧应用,亚马逊云科技大语言模型助您一臂之力
  • Golang Gorm 连接数据库
  • LogBack的日志报错解决办法 org.xml.sax.SAXNotRecognizedException
  • 黑客(网络安全)技术——高效自学1.0
  • Qt中对Udp数据打包发送和接收
  • 已知CSIDL常量值,求其对应路径。
  • ImportError: cannot import name ‘HTTPClientFactory‘ from ‘twisted.web.client‘
  • k8s、数据存储
  • 【数据结构】Lambda
  • 【C++】单例模式
  • 9 mysql调优
  • 大数据毕业设计选题推荐-智慧消防大数据平台-Hadoop-Spark-Hive
  • 带你走进Cflow (三)·控制符号类型分析
  • el-table 多表格弹窗嵌套数据显示异常错乱问题
  • asp.net core mvc之 视图
  • IE9 : DOM Exception: INVALID_CHARACTER_ERR (5)
  • Angular6错误 Service: No provider for Renderer2
  • E-HPC支持多队列管理和自动伸缩
  • isset在php5.6-和php7.0+的一些差异
  • Java Agent 学习笔记
  • JavaScript 一些 DOM 的知识点
  • javascript数组去重/查找/插入/删除
  • JS 面试题总结
  • JS数组方法汇总
  • mysql中InnoDB引擎中页的概念
  • opencv python Meanshift 和 Camshift
  • React-生命周期杂记
  • spring + angular 实现导出excel
  • UMLCHINA 首席专家潘加宇鼎力推荐
  • 大快搜索数据爬虫技术实例安装教学篇
  • 大数据与云计算学习:数据分析(二)
  • 番外篇1:在Windows环境下安装JDK
  • 关于 Linux 进程的 UID、EUID、GID 和 EGID
  • 技术胖1-4季视频复习— (看视频笔记)
  • 开放才能进步!Angular和Wijmo一起走过的日子
  • 理解在java “”i=i++;”所发生的事情
  • 马上搞懂 GeoJSON
  • 我看到的前端
  • ​​​【收录 Hello 算法】9.4 小结
  • ​queue --- 一个同步的队列类​
  • #if #elif #endif
  • $.proxy和$.extend
  • (android 地图实战开发)3 在地图上显示当前位置和自定义银行位置
  • (react踩过的坑)Antd Select(设置了labelInValue)在FormItem中initialValue的问题
  • (八)Flink Join 连接
  • (二)丶RabbitMQ的六大核心
  • (附源码)计算机毕业设计SSM基于java的云顶博客系统
  • (三) prometheus + grafana + alertmanager 配置Redis监控
  • (终章)[图像识别]13.OpenCV案例 自定义训练集分类器物体检测
  • (转)用.Net的File控件上传文件的解决方案
  • (转载)PyTorch代码规范最佳实践和样式指南
  • .jks文件(JAVA KeyStore)
  • .Net Core中的内存缓存实现——Redis及MemoryCache(2个可选)方案的实现
  • .net mvc部分视图
  • .NET 程序如何获取图片的宽高(框架自带多种方法的不同性能)