PHP Parser. How to make a function to extract text for a given link?

Question

I make a parser for PHP which should copy all publications from the site and display this information on my site (this is not content theft, I agreed with the site owner)!

I have already written code that copies the list of publications on the main page (title, photo and short text), now I need to parse the contents of each publication, for this I began to parse links to all publications (on the main page of the site).

Now I need to write a function that will parse the contents of each publication on these links.

Please show with an example how to parse the text that is inside each link!

<?php header('Content-type: text/html; charset=utf-8'); require 'phpQuery.php'; function print_arr($arr){ echo '<pre>' . print_r($arr, true) . '</pre>'; } $url = 'http://lifemomentt.blogspot.com/'; $file = file_get_contents($url); $doc = phpQuery::newDocument($file); foreach($doc->find('.blog-posts .post-outer .post') as $article){ $article = pq($article); $text = $article->find('.entry-title a')->html(); //парсинг заголовков на все публикации print_arr($text); $texturl = $article->find('.entry-title a')->attr('href'); //парсинг ссылок на все публикации echo $texturl; } ?>

I have links to all publications, I don’t know how to parse the contents of these links (the content I have in mind is the information that is when we click on this link)

Yaroslav Molchan Yaroslav Molchan 7.561 2 ten 29 · Answer 1 · 2017-06-08T13:24:13

You do everything just create a function that will accept the URL of the entry and inside the function you are already running the parser, added your example for clarity:

 <?php header('Content-type: text/html; charset=utf-8'); require 'phpQuery.php'; //Функцию можно вынести с файла при желании function parseArticle($url){ $file = file_get_contents($url); $doc = phpQuery::newDocument($file); //Тут парсите так же как и список } function print_arr($arr){ echo '<pre>' . print_r($arr, true) . '</pre>'; } $url = 'http://lifemomentt.blogspot.com/'; $file = file_get_contents($url); $doc = phpQuery::newDocument($file); foreach($doc->find('.blog-posts .post-outer .post') as $article){ $article = pq($article); $text = $article->find('.entry-title a')->html(); //парсинг заголовков на все публикации print_arr($text); $texturl = $article->find('.entry-title a')->attr('href'); //парсинг ссылок на все публикации parseArticle($texturl); } ?>

Please show it on the example of this site lifemomentt.blogspot.com I would be very grateful!

PHP Parser. How to make a function to extract text for a given link?

1 answer 1

More articles: