-
hi
i log in my search console and saw a bunch of weird URLs under ‘noindex’ tag.
Info in “sensitive data:
-
Hello,
Thank you for contacting the support and sorry for any inconvenience that might have been caused due to that.
It certainly looks fishy as there’s no correlation between the website and query. I would recommend trying to run a malware scan and see if there’s any infected files are present.
Let us know how it goes.
Thank you.
hi i tried to run a malware scan but doesn’t seem to have any weird files..but it is showing that weird site URL.
there’s no correlation between the website and query. >> if you copy the full URL of that link i share..it is a page of some gambling text,etc.
Can refer to the snpoboard.io URL i shared in the sensitive data.
To add on..it shows “crawled but not indexed”..it is basically a live page if you click on that URL.
Can you check on that?
Hello,
In this case, you can follow the steps in this link:
https://rankmath.com/kb/internal-site-search-spam/You can also set a rule in your robots.txt to block those URLs from getting crawled.
Disallow: */search/*
Here’s a link you can follow:
https://rankmath.com/kb/how-to-edit-robots-txt-with-rank-math/Hope that helps.
Thank you.
Hello,
I have updated the sensitive data as requested. Can you please check further?
Thank you.
hi
1. I noticed that rank math (by default) will make it as no index. However, if you copy that full url (refer to the sensitive data) and paste it on chrome..it will show that page is still live (refer to image 1 and 3). Currently, prefers the page won’t be indexed..but page is still live.
How can we removed those pages?
2. I tried to edit the robot txt but it shows “Contents are locked because a robots.txt file is present in the root folder.” (refer to image 2). I follow the document as well. https://rankmath.com/kb/cant-edit-robots-txt/. I installed Really Simple SSL plugin, head over to Settings > Hardening and turn off the Disable the built-in file editors option. but it doesn’t help.
Would it be ok to share your my WP access to help on this?
Hello,
It looks like you are not the only one suffering from this hack.
We would recommend re-installing your theme and WP. You can reinstall WP from WordPress Dashboard > Updates.
Then, go to the root folder of your website through FTP and delete folders that match the URL pattern of these suspicious files.
You might also want to seek your web host’s assistance with this.
Finally, make sure your WP installation is secured again these issues by hardening your WP site: https://mythemeshop-com.webpkgcache.com/doc/-/s/mythemeshop.com/blog/wordpress-security-tips/
Hope that helps.
Hi
Thanks for your reply
1. How do we marked on these spam pages as “no indexed”? Currently there are about 2k of these pages are “Crawled – Currently Not Indexed” but “Excluded by 14k pages are ‘noindex’ tag”
In my rank math settings, it was already tick on the “Rank Math > Tiles & Meta > Misc Pages from your WordPress dashboard. Enable the Noindex Search Results option” previously.
2. I tried to edit the robot txt but it shows “Contents are locked because a robots.txt file is present in the root folder.” (refer to image 2). I follow the document as well. https://rankmath.com/kb/cant-edit-robots-txt/. I installed Really Simple SSL plugin, head over to Settings > Hardening and turn off the Disable the built-in file editors option. but it doesn’t help.
Hello,
#1 If you already applied the rules mentioned above to your robots.txt file, then you may wait for the next few crawls by Google to remove them from SERPs slowly. No additional actions will be required as long as Google is unable to crawl those URLs.
#2 Instead of using a plugin, you can simply remove the physical robots.txt file present on your server’s root directory to fix this one.
Hope that helps, and please do not hesitate to let us know if you need our assistance with anything else.
Thank you.
Hello,
I have updated the sensitive data as requested. Can you please check further?
Thank you.
Hello,
I can see the changes are reflected in your
robots.txt
generated by Rank Math at/?robots=1
Please remove the physical robots.txt file from your server’s root directory and clear your site, plugin, or any server-level cache to see if that works for you now.
Let us know how it goes. Looking forward to helping you.
Thank you.
Hi
1. I have added the code, however, it shows that spam page is still indexable. (refer to image 4 in the sensitve data). Those pages are somewhat live on the sites. I am not sure how we can have those spam pages removed. This is because those pages are not marked as suspicious files.
2. I read this post that you shared (https://rankmath.com/kb/internal-site-search-spam/). Previously before this spam happened, I already have this part “Rank Math > Tiles & Meta > Misc Pages from your WordPress dashboard” and Enable the Noindex Search Results option” previously.
However, in my search console data, i saw 14k pages are “excluded by ‘no index tag’.
But 2k of these pages managed to be “crawled – currently not indexed). One of those pages are the link i shared under the sensitive data.
So how do we settled or removed these 2k plus pages crawled but not indexed? Can you suggest what’s the best way to go about this?
Thanks alot
hi
i removed it and clear cache..
it is still showing the same.
That spam URL is still indexable.
Also, following up on my previous question that wasn’t answered yet.
1. I have added the code, however, it shows that spam page is still indexable. (refer to image 4 in the sensitve data). Those pages are somewhat live on the sites. I am not sure how we can have those spam pages removed. This is because those pages are not marked as suspicious files.
2. I read this post that you shared (https://rankmath.com/kb/internal-site-search-spam/). Previously before this spam happened, I already have this part “Rank Math > Tiles & Meta > Misc Pages from your WordPress dashboard” and Enable the Noindex Search Results option” previously.
However, in my search console data, i saw 14k pages are “excluded by ‘no index tag’.
But 2k of these pages managed to be “crawled – currently not indexed). Why is that 2k spam pages like these not being marked as no-index by Rank Math’s settings “Enable the Noindex Search Results option””
So how do we settled or removed these 2k plus pages crawled but not indexed? Can you suggest what’s the best way to go about this?
Hello,
I have replied to your other ticket about feed URLs and internal site search spam here: https://support.rankmath.com/ticket/internal-site-search-spam/?view=all#post-549867
Please refer to other ticket and post your comments and questions there.
Thank you.
The ticket ‘bunch of weird URLs under ‘noindex’ tag.’ is closed to new replies.