Project:LHS Graphs and Visualizations

From London Hackspace Wiki
Revision as of 19:23, 14 December 2010 by 195.24.233.122 (talk)
LHS Graphs and Visualizations


Members Elliot
QR code

Overview

I'd like to supplement the Cacti graphs that we have for LHS bandwidth and power with metrics that provide insight to the growth of our community and organisation over time.

Initially I'd like to chart the following:

  • Number of members
  • Wiki activity

Later I'd like to investigate:

  • Website visitors and/or page impressions
  • Mailing list activity
  • Space occupancy

Phase 1

To chart the initial metrics various bits of data will need to be exposed in a Cacti friendly way. I need help in getting access to these data sources so that I can write the various data input methods.

Number of members

This data is stored in an Sqlite database on Turing. See the Schema. It can be queried like so:

 SELECT COUNT(id)
 FROM users
 WHERE subscribed = true;

And for pending users:

 SELECT COUNT(id)
 FROM users
 WHERE subscribed = false;

Note: So it seems that Cacti runs on babbage, therefore we actually require a PHP script on Turing to expose the member numbers and then a script on babbage to make the HTTP request.

This PHP script should generate the following output for cacti: subscribed:136 pending:2

 <?php
 
 require_once( $_SERVER['DOCUMENT_ROOT'] . '/../lib/init.php'); 
 
 // FIXME: Elliot, what is pending supposed to do? The query is the same.
 
 $subscribers = $db->translatedQuery( 'SELECT COUNT(id) AS num FROM users WHERE subscribed=1' )->fetchRow();
 $pending = $db->translatedQuery( 'SELECT COUNT(id) AS num FROM users WHERE subscribed=1' )->fetchRow();
 
 print "subscribed:{$subscribers['num']} pending: {$pending['num']}";

Perl to fetch member numbers for cacti:

 #!/usr/bin/perl
 
 use strict;
 use LWP::Simple;
 
 eval {
   my $file = get("http://london.hackspace.org.uk/member_stats.php");
   print $file;
   1;
 } or do {
   print "NaN";
 }

Wiki statistics

  • We can get this using the MediaWiki API:
 http://wiki.hackspace.org.uk/w/api.php?action=query&meta=siteinfo&siprop=statistics&format=xml

It returns:

 <?xml version="1.0"?>
 <api>
   <query>
     <statistics
       pages="759"
       articles="215"
       views="229656"
       edits="7368"
       images="186"
       users="166"
       activeusers="22"
       admins="61"
       jobs="13"
     />
   </query>
 </api>

Perl to generate the following output for cacti: pages:759 articles:215 views:229656 edits:7368 images:186 users:166 activeusers:22 admins:61 jobs:13

 #!/usr/bin/perl
 
 use strict;
 use LWP::Simple;
 use XML::Simple;
 
 eval {
   my $file = get("http://wiki.hackspace.org.uk/w/api.php?action=query&meta=siteinfo&siprop=statistics&format=xml");
   my $xs1 = XML::Simple->new();
   my $doc = $xs1->XMLin($file);
   my $first = 1;
 
   foreach my $key (keys (%{$doc->{query}->{statistics}})){
     if ($first eq 1) {
       $first = 0;
     } else {
       print " ";
     }
     print $key . ":" . $doc->{query}->{statistics}->{$key};
   }
   1;
 } or do {
   print "NaN";
 }

Phase 2

Mailing list activity

  • There is no API for Google Groups. Perhaps poll the RSS feed and count unrecognized message ids or look at the date?
 https://groups.google.com/group/london-hack-space/feed/rss_v2_0_msgs.xml?num=50

Website visitors and/or page impressions

The main site and the Wiki use Google Analytics and this has an API. This documented method looks promising:

 https://www.google.com/analytics/feeds/data?metrics=ga%3Avisits%2Cga%3Apageviews&start-date=2010-11-29&end-date=2010-12-13&max-results=50

Phase 3

Space occupancy

  • We could use a directional IR occupancy counter. I think that we already have something like this in the LHS stores.