Python 爬蟲
資工大四 楊翔鈞
2020.05.07
requests
beautifulsoup4
. . .
. . .
body
div
div
table
span
div
class="ooxx"
id="xxoo"
class="abc"
id="div2"
class="abc"
id="div1"
lxml
apt-get install python-lxml
easy_install lxml
pip install lxml
re
就 regular expression
scrapy
requests 常用 method
beautifulsoup4 常用 method
CSS selector
beautifulsoup4 常用 method
node.decompose()
node
參考資源
DEMO :))))