How to collect all the image tags from an html page on RUBY

Question

How to collect all the image tags from an html page on RUBY

Navigation

#1 by (2 votes)

2

My program collects all redirection addresses but I could not do the same with the image tags

    require 'nokogiri'
    require 'net/http'

    pagina  = Net::HTTP.get(ARGV[0],ARGV[1])
    enlaces = Nokogiri::HTML(pagina).xpath('//a[@href]').map { |link| link['href'] }
    imagenes = Nokogiri::HTML(pagina).xpath('//img/src').map { |link| link['src'] }
    puts "Los enlaces son: "
    puts enlaces
    puts "Las imagenes son: "
    puts imagenes

html ruby

asked by Felipe Olaya Ospina 06.11.2017 в 13:55

source

1 answer

How to do frequency SQL queries per hour and per day? Extract the certification chain in c #

score 2 · Accepted Answer

If the page you process is normal HTML, then the problem is that you are trying to read the src tag inside the img tag, when what you want to read is the src attribute of the% tag img (something like what you do with the href of the a tags).

Then you just have to change to //img[@src] in this line of code:

imagenes = Nokogiri::HTML(pagina).xpath('//img[@src]').map { |link| link['src'] }