Get the text inside the tag b

Question

Get the text inside the tag b

Navigation

#1 by (2 votes)

2

I have a chain with information of this type:

 $texto = "cualquiercosa<b>contenido</b>cualquiercosa";
 $resultado = preg_split("<b></b>",$str)

However, the result does not bring the text between the labels. Could you help me on how the regular expression should be?

php regex

asked by Islam Linarez 04.10.2017 в 22:32

source

1 answer

ssl.SSLError: [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed (_ssl.c: 748) Problem with List string

score 2 · Accepted Answer

You should not use regular expressions to process HTML. Just a small change in the HTML would make your regex fail. A space of more, a change in the attributes of the tag, a comment, or more complex structures, would make even a gigantic regex not follow the rules.

It's very easy to process HTML with DOM , they are the tools that They are designed for that.

The DOM is simply generated as follows:

$html = 'cualquiercosa<b>contenido</b>cualquiercosa';

//Generar el DOM
$dom = new DOMDocument;
$dom->loadHTML($html, LIBXML_COMPACT | LIBXML_HTML_NOIMPLIED | LIBXML_NONET);

And we can get all the <b> :

//Obtener todos los tags <B>
$b_nodelist = $dom->getElementsByTagName('b');

To then iterate over the list of results, getting the text inside each label (without labels):

//Bucle para cada <b>
foreach ($b_nodelist as $b) {

    //Obtener el contenido de texto del tag
    $texto = $b->textContent;

    echo "\n\nContenido del B:\n" . $texto;    // => contenido
}

Result:

Contenido del B:
contenido

Demo:

See the 3v4l.org demo