Forum Moderators: Robert Charlton & goodroi

Message Too Old, No Replies

Analyzing some data of each URL of my website

With Google Search Console API and Apache logs

         

guarriman3

3:02 pm on Jan 23, 2024 (gmt 0)

10+ Year Member Top Contributors Of The Month



I'm trying to automate the analysis of the URLs of my website, in order to detect crawling errors or trigger some alarms.

I would loop all the URLs of my database, and fill the data of some fields. I'm thinking in the following fields for each URL, getting the information from the Apache logs and/or the Google Search Console API (GSC-API).

Fields for the URLs:
  • Ok in the Sitemaps: GSC-API?
  • Last crawling date: Apache logs + GSC-API?
  • Indexed in Google: GSC-API
  • Non-indexed status (redirect, noindex, crawled not indexed, discovered not indexed, soft 404...): GSC-API?
  • Visits from Google in a specific time period: Apache logs + GSC-API?

    My main problem is to know the capabilities of GSC-API, since I've just started to test it. Do you know if it's possible to get such information (last crawling date, non-indexed status, visits from Google) from GSC-API?
  • phranque

    8:27 pm on Jan 23, 2024 (gmt 0)

    WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month


    in general, yes.

    you should examine the various methods and objects described in the [url=https://developers.google.com/webmaster-tools/v1/api_reference_index]Google Search Console API Reference[/url]

    for example this is the [url=https://developers.google.com/webmaster-tools/v1/urlInspection.index/UrlInspectionResult]
    UrlInspectionResult object description[/url]