Sitemap pages return 404 and pages not indexed

#850790
  • Resolved Sameer Ambhore
    Rank Math free

    Hello,

    Bhanu asked me to post my request here.

    None of my pages on https://planet11plus.co.uk are indexed on google – I have requested for a few weeks now.

    The sitemap pages keep breaking resulting in 404 errors almost every day making me reset the Rank Math configuration and saving again after which they become accessible again.

    I have checked that robots.txt has no blocks, the sitemap is correct and the live pages can be successfully tested on Google search centre.
    When testing Live URL I get a conflicting message saying URL is on google.

    I’m stuck now as I have no idea what I should do as there are no errors or anything that I can work on to get this corrected.

    1. Validated sitemap (https://planet11plus.co.uk/sitemap_index.xml). No issues detected but it keeps coming with 404 errors after every 1-2 days.
    2. The Index check box is enabled/checked on the Robots Meta section of Title and Meta.

    Appreciate your support.

    Many Thanks,
    Sameer

Viewing 15 replies - 1 through 15 (of 20 total)
  • Hello,

    We apologize for the issues you are facing with your sitemap and indexing on Google. Thank you for providing detailed information about your situation.

    Here are some steps you can take to troubleshoot and potentially resolve the 404 errors and indexing issues:

    1. Flush Permalinks

    Sometimes, flushing the permalinks can resolve 404 errors. To do this:

    • Go to Settings → Permalinks in your WordPress admin area.
    • Click the Save Changes button without making any changes.

    Flush permalink settings

    2. Check for Plugin Conflicts

    Since you mentioned that the sitemap keeps breaking, it might be due to a plugin conflict. We recommend using the Health Check plugin to identify any conflicts with other plugins.

    3. Ensure Posts Are Set to Index

    You mentioned that the Index checkbox is enabled, but please double-check that all your posts are set to index. You can do this by going to Rank Math SEO → Titles & Meta and ensuring that the Index checkbox is checked for all relevant post types.

    4. Monitor Server Configuration

    If you are using Nginx or Apache, ensure that your server configuration is correctly set up to handle the sitemap requests. Sometimes, server settings can inadvertently block access to the sitemap.

    5. Check for Caching Issues

    If you have caching enabled (either through a plugin or server-side), try clearing the cache. Caching can sometimes serve outdated versions of your sitemap, leading to 404 errors.

    6. Review Robots.txt File Again

    Even though you mentioned there are no blocks in your robots.txt file, it’s worth reviewing it again to ensure there are no unintended rules that could affect indexing.

    7. Google Search Console

    Since you are receiving conflicting messages in Google Search Console, ensure that you have submitted the correct sitemap URL and monitor any messages or warnings provided by Google regarding indexing issues.

    If these steps do not resolve the issue, please reply to this ticket, and a human agent from the Rank Math Support team will take over and assist you further.

    We appreciate your patience and understanding as we work to resolve this issue for you!

    Sources:
    Troubleshooting the “Couldn’t Fetch” Error in Google Search Console » Rank Math
    Rank Math Sitemap Issues and Their Fixes » Rank Math

    1. Permalinks flushed.
    2. No plugin conflicts detected.
    3. Ensured that posts are set to index.
    4. Server settings doesnt seem to be an issue.
    5. I have flushed the cache from the server side.

    6. robots.txt is the same from earlier (when I used Yoast SEO) and there are no blocks:
    User-agent: *
    Disallow:
    Sitemap: https://planet11plus.co.uk/page-sitemap.xml
    Sitemap: https://planet11plus.co.uk/sitemap_index.xml

    7. No errors from google search console, except one from the past that mentions “No referring sitemaps detected”. However the referred page is an old wordpress post which makes no sense.

    Hello,

    Thank you for your query and we are so sorry about the trouble this must have caused.

    We checked your sitemap and it seems to be working just fine.

    To ensure that your sitemap is free from caching, make sure to follow this guide as well: https://rankmath.com/kb/exclude-sitemaps-from-caching/

    As for the pages that are not indexed, please share some example URLs here so we can take a closer look.

    Also, please submit the affected URL/s to your Google Search Console’s URL Inspection tool and share a full screenshot of the report.

    Looking forward to helping you.

    Sameer Ambhore
    Rank Math free

    Hello Jeremy,

    Thanks for your message.
    None of the pages are indexed: For example: https://planet11plus.co.uk OR https://planet11plus.co.uk/subjects

    I have uploaded PDF with the screenshots from the Google Search COnsole here:
    https://planet11plus.co.uk/wp-content/uploads/2024/08/Google-Search-Console-Screenshots-2.pdf.

    Please let me know if you need more information.

    Many Thanks,
    Sameer

    Hello,

    Please update your robots.txt to our default and recommended rules for indexing:
    https://rankmath.com/kb/how-to-edit-robots-txt-with-rank-math/#default-rules

    Also, you should remove the page-sitemap.xml from your GSC since you already submitted your sitemap index.

    You can remove all of them and resubmit the /sitemap_index.xml only. Here’s our guide you can follow:
    https://rankmath.com/kb/submit-sitemap-to-google/

    Once done, give Google some time to crawl your URLs.

    Looking forward to helping you.

    Sameer Ambhore
    Rank Math free

    Hi Reinelle,

    Thanks for your message.

    As advised:
    I have changed robots.txt to RankMath default: https://planet11plus.co.uk/robots.txt

    I have removed all sitemaps and added /sitemap_index.xml again. However, the total number of discovered pages for this sitemap is showing zero, as shown in the screenshots sent in my last email. Is that okay?

    Many Thanks for your support,
    Sameer

    Hello,

    Please note that each time Google uses a sitemap to find a URL, the count of the Discovered URLs increases by one. Sitemaps is only one of the few methods Google uses to discover your URLs.

    When the count of the Discovered URLs is zero or does not match your actual sitemap in your Google Search Console, it means Google didn’t use the sitemap to find the URLs.

    It mostly happens when you have good internal linking or use the Instant Indexing plugin.

    Here’s a link for more information:
    https://rankmath.com/kb/zero-discovered-urls-through-sitemap/

    Looking forward to helping you.

    Sameer Ambhore
    Rank Math free

    Hi Adetayo,

    Thanks for your reply.
    Earlier I used two sitemaps /sitemap_index.xml (0 discovered pages) and /page-sitemap.xml (71 discovered pages).
    As recommended I’m now using only /sitemap_index.xml (0 discovered pages) which I resubmitted after removing all sitemaps.
    I understand that google uses other methods to discover pages; does it mean the submitted sitemap has no relevance?
    I’m getting anxious as nothing seems to be working and it’s not clear how long it will take for google to finally index the pages.

    Many thanks for all your support,
    Sameer

    Hello,

    To be sure that Google is crawling the website you can head over to Google Search Console and in the settings select the crawling report which can be found like so:
    Report

    If you see stats for crawling it means that Google is seeing your website which is a good thing. If Google is seeing the website but not indexing, you need to head over to the indexing report on GSC and see if you are seeing any error preventing indexing mentioned there.

    At the moment we can see that the sitemaps and the website are accessible to Googlebot so this could be just a matter of time or a content issue that prevents Google from indexing the website.

    Don’t hesitate to get in touch if you have any other questions.

    Sameer Ambhore
    Rank Math free

    Hello Miguel,

    Thanks for your reply.

    When you say “content issue”, what exactly does it mean? I do not see any errors related to content, could I be missing anything obvious?

    As you pointed out, I can see Google is able to crawl the pages in crawl stats. But is has been like this for days/weeks and I do not see any errors that I can work on 🙁

    Many Thanks,
    Sameer

    Hello,

    Google uses its algorithm to determine if a webpage is suitable for its SERPs. When we refer to “content issue” we mean there’s also the possibility that the content of your site’s pages can affect its indexing. Some of these issues that can hinder indexing are when the content is a duplicate of some other webpage on the internet, the quality is poor, the content is too thin or it is irrelevant to users’ search intent/query.

    To increase the chances of Google indexing your site’s pages ensure that your content is unique, valuable, and relevant.

    Given that Google can crawl your pages, it’s a positive sign. However, indexing can sometimes take a bit longer depending on various factors, including the points above.
    Here’s what you can do:

    – Use the URL Inspection tool in Google Search Console to manually request indexing of your main pages.

    – Double-check your content quality and ensure it’s free from issues that could affect indexing.

    Please keep in mind that indexing can take time, and being patient is crucial.

    We hope this helps.

    Thanks.

    Sameer Ambhore
    Rank Math free

    Hi,

    My sitemap URLs are showing 404 errors again. Why would this happen frequently? Even my robots.txt is not the one after making changes last week.
    I had changed my robots.txt to Rank Math default as advised but it is now revrted back to the old robots.txt again.

    What could be causing these discrepancies? I have already insured there are no conflicts or caching issues.

    Rgds,
    Sameer

    Hello,

    While you’ve mentioned there are no conflicts, sometimes certain plugins or themes can cause these kinds of issues by resetting configurations.

    Try temporarily disabling all other plugins except Rank Math to see if the issue persists. If the problem goes away, re-enable the plugins one by one to identify the cause.

    If the problem continues, it might be helpful to reach out to your hosting provider to ensure there are no automated scripts or settings that are causing these reverts.

    Also, even though you’ve cleared the cache, there might be deeper caching mechanisms in place (like server-level caching or CDN caching) that are causing old versions of files to be served.

    We hope this helps.

    Thanks.

    Sameer Ambhore
    Rank Math free

    Disabling all plugins is not possible as this problem is intermittent and I see 404 errors after every few days.
    I see conflicting messages from google search console but the attached error has been there for a long time now.
    null
    Any help will be really appreciated.
    Rgds,
    Sameer

    Hello,

    The “no referring sitemap detected” means the page was discovered by Google just not through the sitemap.

    Please note that a sitemap is only one of many ways Google discovers the URLs of your site. If Google discovers the URL through other means, it may not cross-reference your sitemap to check if the URL is there. Hence the “No referring sitemaps” message.

    We shared more about this here: https://rankmath.com/kb/no-referring-sitemaps-detected/

    You can also find information about the “Crawled – Currently Not Indexed” error here: https://rankmath.com/kb/crawled-currently-not-indexed/

    We hope this helps.

    Thanks.

    Hello,

    Since we did not hear back from you for 15 days, we are assuming that you found the solution. We are closing this support ticket.

    If you still need assistance or any other help, please feel free to open a new support ticket, and we will be more than happy to assist.

    Thank you.

Viewing 15 replies - 1 through 15 (of 20 total)

The ticket ‘Sitemap pages return 404 and pages not indexed’ is closed to new replies.