GitXplorerGitXplorer
e

GWT_IndexStatus-php

public
6 stars
7 forks
0 issues

Commits

List of commits on branch master.
Unverified
6a7aead14a93f88267bcffc4db59b1ea17ce6cbc

Don't interpret leading '@'s in POST values.

eeyecatchup committed 11 years ago
Unverified
0361e3fb539a37b359ac94ac316c01c80545e29d

Don't interpret leading '@'s in POST values.

eeyecatchup committed 11 years ago
Unverified
1d550002cebb2f4778e1e58f8ba626e881750b8c

Merge pull request #2 from pkollitsch/master

eeyecatchup committed 11 years ago
Unverified
a19e7aea72dc9c2e3feba1bb9b8a45d301fbb9e3

Fix for #1 at original repo: Request parameters changed

ddavidsneighbour committed 11 years ago
Unverified
c533de61e560e6a08d0ee4e02a6db9f52a862478

Added two-step-verification note to README file.

committed 12 years ago
Unverified
8a88d2a3f16e7d850cdf1426dbc1bc2cd4256ffd

Added two-step-verification note to README file.

committed 12 years ago

README

The README file for this repository.

GWT_IndexStatus: Request Index Status data from Google Webmaster Tools.

Introduction

Since Googlebot was born, webmasters around the world have been asking one question: Google, oh, Google, are my pages in the index? In July 2012 Google answered that question with a new feature in Webmaster Tools called 'Index Status'. Further information can be found here: http://googlewebmastercentral.blogspot.de/2012/07/behold-google-index-secrets-revealed.html.

This PHP class lets you request that information script-wise and returns the 'Index Status'-data in JSON format.
The package also includes an example how to automatically download the data into CSV files.

Download

The latest stable version can be downloaded from the downloads tab, or using the following link:
https://github.com/downloads/eyecatchup/GWT_IndexStatus-php/GWT_IndexStatus-php-1.0.0.zip

Usage - Get Started

To get started, the steps are as follows:

  • Download the Zip file (click here) and unzip it.
  • Create a folder and add the GWT_IndexStatus.php script from the extracted Zip to it.

Authentication

To request data from your Google Webmaster Tools Account you must authenticate requests using your Google Account login credentials. There are two ways to do so.

Authentication via Interface

The easiest way is to set your login credentials using the class interface GWT_Client. To do so, open the GWT_IndexStatus.php file, go to line 53 and enter your login credentials.

<?php
	interface GWT_Client
	{
		const Email = 'username@gmail.com';
		const Passwd = 'secretpassword';
	}

(Explicit) Authentication via login()

Alternatively, you can use the login() method to authenticate your requests. This is also handy if you want to request data for another account than defined in the interface. For an example, please see the php file example4.php.

NOTE: The script will NOT work, if you have two-step-verification enabled. (If you don't know what that is, just forget about this note. ;-))

Usage - Examples

Example 1 - getDataByDomain()

To download index status data for a single domain, the steps are as follows:

  • In the same folder where you added the GWT_IndexStatus.php, create and run the following PHP script.
    Note: You'll need to authenticate your request (see above)! Also, you'll need to replace the example URL with a domain that is registered in your GWT account.
<?php
require_once('GWT_IndexStatus.php');

try {
  $indx = new GWT_IndexStatus();

  $json = $indx->getDataByDomain('http://www.domain.tld/'); # NOTE: Must have a trailing slash!
  print_r( json_decode($json) );
}
catch (Exception $e) {
  die( $e->getMessage() );
}

By default, the class will return totals of indexed pages, the cumulative number of pages crawled, the number of pages that Google knows about which are not crawled because they are blocked by robots.txt, and also the number of pages that were not selected for inclusion in Google's search results. However you can adjust the request parameters using the setParam() method (as done in the 2nd example).

Example 2 - getDataAllDomains()

To download index status data for all domains that are registered in your Webmaster Tools account, the steps are as follows:

  • In the same folder where you added the GWT_IndexStatus.php, create and run the following PHP script.
    Note: You'll need to authenticate your request (see above)!
<?php
require_once('GWT_IndexStatus.php');

try {
  $indx = new GWT_IndexStatus();
  
  /**
   * Exclude 'blocked by robots.txt'-data.
   * Valid parameters are 'is-crawl' (Ever crawled), 'is-indx' (Total indexed), 
   * 'is-rm' (Removed) and 'is-rbt' (Blocked by robots).
   */
  $indx->setParam('is-rbt', 0); 

  $json = $indx->getDataAllDomains();
  print_r( json_decode($json) );
}
catch (Exception $e) {
  die( $e->getMessage() );
}

Example 3 - Save data as CSV

To download index status data for all domains that are registered in your Webmaster Tools account and save the resultsets into CSV files, the steps are as follows:

  • In the same folder where you added the GWT_IndexStatus.php, create and run the following PHP script.
    Note: You'll need to authenticate your request (see above)!
<?php
require_once('GWT_IndexStatus.php');

try {
  $indx = new GWT_IndexStatus();

  $json = $indx->getDataAllDomains();
  $domains = json_decode($json);

  foreach ($domains as $domain) {
    $data = $domain->indexData;
    $colCount = sizeof($data->cols);

    // Get column label names for CSV headline
    $labels = Array();
    for ($i=0; $i < $colCount; $i++) {
        $labels[] = $data->cols[$i]->label;
    }
    $cols = implode(';', $labels);

    // Start new CSV output string.
    $csv = "$cols\n";

    // Iterate through all resultsets and add rows to CSV output string
    // containing the columns as defined per instance.
    foreach ($data->rows as $r) {
      $row = Array();
      for ($i=0; $i < $colCount; $i++) {
          $row[] = $r->c[$i]->v;
      }
      $csv .= implode(';', $row) ."\n";
    }

    // Write CSV string to file.
    $filename = 'GWT_IndexStatus_' .
      parse_url($domain->domain, PHP_URL_HOST) .'_'.
      date("m-d-Y") .'.csv';

    if (file_put_contents($filename, $csv))
      print "Saved $filename\n";
  }
}
catch (Exception $e) {
  die( $e->getMessage() );
}

This example code will create a CSV file for each domain that is reqistered in your GWT account in the same folder where you run the script.

Legal Note

The author of the software is not a partner, affiliate, or licensee of Google Inc. or its employees, nor is the software in any other way formally associated with or legitimized by Google Inc.. Google is a registered trademark of Google Inc.. Use of the trademark is subject to Google Permissions: http://www.google.com/permissions/index.html.

Copyright / License

URL: https://github.com/eyecatchup/GWT_IndexStatus-php/
License: http://eyecatchup.mit-license.org/
(c) 2012, Stephan Schmitz eyecatchup@gmail.com