Gem Nokogiri 获取 td 里面的内容或者某个元素后面的内容

luffycn · October 07, 2014 · Last by luojie2017 replied at December 22, 2016 · 2962 hits

如果有这么一段 html

text

....

用这段代码 nodeset = content.css('td[@width="70%"]') data = nodeset[0].to_s

输出的 data 会包含有 td 标签本身如何去掉 td 了

另外，如何获取 h3 标签后面的所有 p 内容了

gihnius #0 October 07, 2014

用 xpath Nokogiri::HTML(html).xpath('//td').text Nokogiri::HTML(html).xpath('//td/h3').text Nokogiri::HTML(html).xpath('//td/p').text

1 likes

sitoto #1 October 07, 2014

inner_html

xx_lee #2 October 08, 2014

html = Nokogiri::HTML('<td width="70%"><h3>text</h3><p>....</p><p>....</p><p>....</p></td>')
puts html.css("td").children # 或inner_html
puts html.css("td p").text

luojie2017 #3 December 22, 2016

请问：如何抓取当前页面的页面信息？

#3 楼 @xx_lee

You need to Sign in before reply, if you don't have an account, please Sign up first.

Gem Nokogiri 获取 td 里面的内容 或者某个元素后面的内容

Gem Nokogiri 获取 td 里面的内容 或者某个元素后面的内容

text

Gem Nokogiri 获取 td 里面的内容或者某个元素后面的内容

Gem Nokogiri 获取 td 里面的内容或者某个元素后面的内容