The site in the search results Google appeared categories for non-CNC links. For example, instead of https://life-thai.com/category/countries/malaysia/ Google shows https://life-thai.com/?cat=31 From where Google took all this - it is not clear. Everywhere used CNC. Sitemap.xml checked, there are also no such links anywhere.

Interestingly, when you click on https://life-thai.com/?cat=31 in the issue, the page with the url https://life-thai.com/category/countries/malaysia/ opens. Those. there is a redirect. But in .httaccess such a redirect is not configured.

Wordpress website.

In the Google Webmaster Console there was an error "Duplicate Title", which is understandable.

Question: how to solve the problem, not closing the category from indexing?

  • one
    Telepathists are all gone, and experts work with specific data. They need a website address. - SeVlad
  • here she is - super
  • The address in question needs to be added. And the real ones. life-thai.com/?cat=987 does not exist. - SeVlad
  • @SeVlad clarified, added real url - super
  • one
    Thanks for the answer. CNC is used from the very appearance of the site. But the categories were closed from indexing until recently. - super

1 answer 1

Close from indexing technical duplicates of pages, as well as pages on which all content in one form or another is duplicated from other pages (calendars, archives, RSS):

 User-agent: Yandex Allow: / Disallow: /cgi-bin/ Disallow: /wp-admin/ Disallow: /wp-includes/ Disallow: /wp-login.php Disallow: /wp-register.php Disallow: /xmlrpc.php Disallow: /path.php Disallow: /readme.html Disallow: /*/feed/ Disallow: /?s= Disallow: /?p= Disallow: /?cat= # ответ на ваш вопрос Crawl-delay: 2 User-agent: GoogleBot Allow: / Disallow: /cgi-bin/ Disallow: /wp-admin/ Disallow: /wp-includes/ Disallow: /wp-login.php Disallow: /wp-register.php Disallow: /xmlrpc.php Disallow: /path.php Disallow: /readme.html Disallow: /*/feed/ Disallow: /?s= Disallow: /?p= Disallow: /?cat= # ответ на ваш вопрос Crawl-delay: 2 Host: site.com Sitemap: https://site.com/sitemap.xml 

Learn more about compiling the right robots.txt here .

PS In the comments I ask everyone who minus, to adequately indicate the reason why this option is in your opinion wrong.

  • one
    не закрывая от индексации категории - super
  • Comments are not intended for extended discussion; conversation moved to chat . - PashaPash
  • 3
    With this robots.txt Google will swear that it can not check the site for mobile adaptability and something else, because will not be able to scan scripts. There was a similar configuration. Bad idea. - super
  • @superpantera I have scripts in the JS and CSS folders. If you have to / wp-content / - allow indexing this folder. Otherwise, access to technical folders is prohibited there, for which the server will still give 403. So what's the point of giving it and creating a load on the server, when you can give only content that will appear in the output? - Vadizar