Confessions of a Compulsive Coder: 2011

Tuesday, December 6, 2011

Guernsey - a REST client for Python

Having written a number of REST clients in Python over the last few years, they all start out the same way; write a simple piece of code to call one service, then realize that you have to call a second service, then a third and so the code becomes more and more generic. However, starting the next project I have to untangle that code from project A and copy into project B - if possible, changing an employer does make code reuse a pain.

So this time, when the need arose I decided to write the client as a separate and reusable package - Guernsey. I then looked at the API I had developed before and it really wasn't very good, or rather it still showed aspects of the first project I created it for. Right now my day job also includes writing Java REST services and we use JAX-RS which give us a really annotation based model. As an implementation we use the Sun Jersey implementation which also has a very nice client package, so I set out to see if I could use that design as a basis for a Python package.

The result is pretty close to the Jersey API, which may be of value to anyone working in Java and Python (which I do regularly) but also it simply benefits from the effort someone else put into the API design. I also took care with naming to keep things as close as possible, but to use a Python style (lowerCamelCase became lower_camel_case, some getter/setters became simple object properties, etc.).

The following example shows a simple client that accesses a public REST service.

from guernsey import Client

client = Client.create()
resource = client.resource('http://www.thomas-bayer.com/sqlrest/')
resource.accept('*/xml')
namespaces = resource.get()

The Client class is the entry point for most common cases, so for most uses you can simply import the one class.
Calling Client.create will construct a new Client object, this method takes a dictionary that provides configuration parameters, or without configuration will create a new object with all default values.
Calling client.resource will construct a new WebResource object which is the actual object you perform REST methods against.
The WebResource object has a number of methods which can be used to configure the resource behavior; in this example we simply set the accept headers to denote that we wish to be returned XML data if possible.
Finally we call the get method on the WebResource (each WebResource supports get, head, put, post, delete and options by default but could be extended to support other methods).

All the configuration methods on the return WebResource the object back to the caller which allows configuration methods to be chained together which can make some code more readable.

namespaces = client.resource('http://www.thomas-bayer.com/sqlrest/') \
                   .accept('*/xml', 0.8) \
                   .accept('*/json', 0.2) \
                   .accept_language('en-US') \
                   .get()

In this example we can see that the setup methods accept and accept_language are all chained together and finally the get method is called. A number of simple methods are provided to manipulate common headers, although if you want to add your own header you can use the add_header method.

Navigating Resources

Where your REST service follows a true hypertext (HATEOS) style one would expect that navigating between resources should be simple as any resource would provide within it a navigable URL from it to any related resource. However, some services still do not provide a true hypertext style, either returning relative URLs or worse returning identifiers which a client has to use to construct a URL.

Guernsey should help you in all three cases:

namespaces = resource.get()
data = namespaces.parsed_entity
# Hypertext link:
new_resource = client.resource(data['resource_url'])
# Relative path:
new_resource = resource.path(data['resource_path'])
# A sub-resource identifier:
new_resource = resource.sub_resource(data['resource_id'])
# Just an identifier:
new_resource = client.resource('http://example.com/{resource_id}', data)

In the first case we have, in our parsed response data, a full URL and so we can easily call the client.resource method to construct a new WebResource.
In the second case we have a new path relative to the path of the current resource, in this case we can use the resource.path method which will construct a new absolute URL by resolving the provided path against the current URL.
In the third case we only have, in our parsed response data, an identifier which corresponds to a path segment that can be appended to the current resource path (commonly termed a sub-resource identifier).
In the last case we need to construct an entirely new resource, but using the identifier in the parsed response data, in this case we use a templated resource URL in the call to client.resource.

Handling Entities

In the section above we mentioned the use of a parsed entity, the Guernsey client has the ability to detect a known resource representation and then convert from its serialized form returned from the REST service into a common Python form. In all cases any entity returned from the service will be put into the entity property of the response, if the Content-Type header in the response identifies a type that can be de-serialized then the response will contain a parsed_entity property.

namespaces = resource.get()
raw_data = namespaces.entity
xml_data = namespaces.parsed_entity

The client performs this by using the EntityReader and EntityWriter classes in the guernsey.entities module. These classes define a simple interface for a subclass to determine whether a given type can be serialized, and if so to convert it to/from a reasonable Python form. Instances of these classes are added to the array property entity_classes on the Client object and will be called in order to process any request and response.

The guernsey.entities module contains reader/writer classes for XML and for JSON and these are added by default to the standard Client. The JSON reader will convert serialized JSON into Python dictionaries and standard types using the standard json module, the writer will serialize Python dictionaries and standard types into the JSON serialized form. The XML reader and writer convert between serialized XML and ElementTree objects from the xml.etree.ElementTree standard module.

Additional entity readers and writers can easily be constructed, and if added to the client will be used by any resource created by the client.

Client Filters

An additional capability of Jersey, and therefore Guernsey, is the ability to write client-side request/response filters. These filters work much like Servlet filters in Java or WSGI middleware in Python in that they are chained together, the handler is given the current request and is able to modify the request before passing it to the next handler in the chain. The last handler actually performs the request, constructs a response object and returns it back down the chain. Each handler is then able to modify the response before returning it itself.

A number of common filters are provided to enable logging, Gzip encoding and MD5 content hashes. These can be added simply by using the add_filter method on a resource.

resource = client.resource('http://www.thomas-bayer.com/sqlrest/')
namespaces.add_filter(LoggingFilter('TestFilterLogging'))
response = namespaces.accept('*/xml').get()

Each handler will have a handle method which will be given a client request and be expected to return a client response. Each handler will also be responsible for calling client_request.next_filter to ensure that all filters are chained correctly. For example, the following is the implementation of the handler method in the logging filter shown above.

def handle(self, client_request):
    logger = logging.getLogger(self.log_name)
    try:
        logger.info(client_request.method + ' ' + client_request.url)
    except:
        print 'Error writing request to log'
    client_response = client_request.next_filter(self).handle(client_request)
    try:
        logger.info(client_response.url + ' ' + 
            str(client_response.status)+ ' ' + 
            client_response.reason_phrase)
    except:
        print 'Error writing response to log'
    return client_response

Implementation

I chose to use urllib2, its low level, has a lot of features although not as many as Joe Gregorio's httplib2, but have also a lot of experience using it. I may well change to using httplib2 in the future and there's no reason the Guernsey API should change whichever underlying module I use.

On the name, if you haven't figured it out yet, both Jersey and Guernsey are part of the Channel Islands.

Thursday, November 17, 2011

Minifying CSS - inline images

Having worked on a pretty complex web application, a lot of JavaScript with Dojo, we had to tackle the common problem of browser load performance. Dojo provides some pretty good build tools to minify the JavaScript used by your application, and does compress all the CSS in your application directories as well. The issue is that our CSS references a lot of images, either background, buttons, or other images used in widgets.

So, one possibility is to actually inline the images themselves into your CSS, and with compression and concatenation of the CSS you get to one nice big resource rather than a whole bunch of related and smaller resources.

The trick is to identify all the CSS rules that have values of the form url("...") or url('...'), which are relative file locations. Now, load that resource, convert it into a base-64 encoded value and write this back out as a value of the form:

url('data:image/png;base64,...encoded-value...')

We use the simple Python script below to process a single CSS file, then a shell script that finds and applies this to all our CSS files before we run the standard Dojo build process.

#!/usr/bin/env python

import base64, logging, os, os.path, re, sys

log = None

RE_URL = re.compile("url\([\"'](?P[^\"']+)[\"']")

def init():
    global log
    logging.basicConfig(level=logging.DEBUG)
    log = logging.getLogger('cssmin')

def parse_command_line():
    " returns (options, args) "
    global log
    if len(sys.argv) == 0:
        print "Must specify a CSS file"
        sys.exit(1)
    return ({}, sys.argv[1:])

def encode(file):
    input = open(file)
    encoded = base64.b64encode(input.read())
    input.close()
    return encoded

def process_css(css_file):
    original_file = "%s.original" % css_file
    os.rename(css_file, original_file )
    log.info("Reading file %s." % original_file)
    input = open(original_file, "rt")
    log.info("Writing file %s." % css_file)
    output = open(css_file, "wt")
    for raw_line in input:
        match = RE_URL.search(raw_line)
        if match:
            url = match.group("url")
            src = os.path.join(os.path.split(original_file)[0], url)
            type = os.path.splitext(src)[1]
            if type[1:].lower() in ["gif", "png", "jpeg", "jpg"]:
                data = encode(src)
                output.write(raw_line[:match.start("url")])
                output.write(
                    "data:image/%s;base64,%s" %
                    (type,data)
                    )
                output.write(raw_line[match.end("url"):])
                log.info("inlined resource %s." % src)
            else:
                log.info("ignoring url for type %s." % type)
                output.write(raw_line)
        else:
            output.write(raw_line)

if __name__ == '__main__':
    init()
    (options, args) = parse_command_line()
    process_css(args[0])

Tuesday, October 18, 2011

An Apple users first week with Android

I recently gave up on my aging iPhone 3G, the GPS hardware had failed, the upshot of which was that if I turned on GPS in any app the phone slowed and locked up. The lack of software updates for the older iOS on the 3G meant that worked turned off access to the corporate email because Apple even stopped issuing security patches for older devices. Finally, battery life, even without corporate email, was so poor I could easily get caught out without a usable phone. Also, and most disturbing, my 16Gb phone was full, I'd used it all up on music, I'd stopped synching TV and movie files, then eBooks, but eventually it stopped synching all together.

So, I waited, and waited, would the rumored new iPhone 5 be everything I wanted? Could I wait that long (this was back in August)? Also Amazon announced the Kindle Fire, a simple but nice Android tablet with a great price, I knew I wanted an Android tablet, even though as an Apple household we had an iPad. I really never objected to the Apple lock-down of my phone, after all I always needed it to be able to take a call and didn't want rogue apps turning it into a brick, but with the table, why can't I develop on it? why am I restricted from running an interpreter? I know Apple did lighten up on some of that, and I did run scheme on the iPad which was kind of fun but I coveted the tablets I could shell into and run Python. So, the Fire is definitely on my wish list.

So, October arrives and the iPhone 5 became the iPhone 4S, I've seen colleagues with different Android phones, Droids, Atrias, Galaxys, but certainly the Galaxy S II seemed to have both traction and really looked nice. So, when my 3G finally because just too much of a pain, I bought a Samsung Galaxy S II to replace it, upgrading my iPhone on my AT&T plan which was a bit of a circus but eventually managed to keep my unlimited data.

So, Thursday, in a meeting a few of us were talking phones and when I said I'd gone from iPhone to Android (or should I say iOS to Android, Apple to Samsung, or iPhone to Galaxy S II?) I got the response "but isn't that a downgrade?". Which caused me to think, what have I given up, what isn't so nice about the Samsung phone or Android OS? The fact I even asked the question implies that I assume that I've given something up, right?

So, after a full week with the phone, and I use my phone a lot so its a short enough time to remember first impressions but long enough to have made some lasting ones too, I wanted to muse on that thought for a while.

Setting up the phone was easy, there were no manuals apart from the usual fold-out getting started guide, but I was able to figure out the general operation and the Androidy way of doing things such as the fixed navigation keys etc. I love the screen on the Galaxy (I'll drop the S II now), and while I know the iPhone 4/4S has a great screen the brightness, depth of color and size (4.3") is really nice. The phone is fast, I mean really fast and with the new AT&T network it browses fast and downloads fast and while its not so fast at home in our cell-challenged area, it actually works which was always at best a 50/50 proposition with the iPhone. So, in initial impressions, did I lose anything? well I guess I lost rounded buttons, the Android buttons are very square, there's no alpha-blended lighting effects either, but funnily enough the buttons work just fine without that - wow, who'd have known?

Also, in terms of giving things up, I knew I needed to get my music back on my device, or did I? Along with the iPhone 4S launch we heard about iCloud, but to be honest Apple is so late to the party all the good drinks and nibbles were finished ages ago. All my personal email is with Google, I use Google reader, documents on Google Docs and Evernote, and now I have my Amazon CloudDrive and Cloud Player. So do I need that much on-phone storage any more? For this first week I decided not to sync more than I had to, it worked fine, my email, contacts, docs and more connected with my phone easily and seamlessly, I listen to my music and ebooks on the bus streaming from Amazon, read using my Kindle reader and while some of those things wrote to the phone it's really now just a cache, not long-term storage. This really does feel much more like a cloud phone than the iPhone will for a while.

So, I started to think about my next bug-bear for usability, consistency, one of the nice things about the Mac as a platform which we expected to transition to iOS was that Apple's design guidelines and aesthetic have led to a consistency of look and behavior across apps on the platform, which directly leads to the easy adoption of the platform by new users, when you learn the basics most apps seem pretty familiar. Now, I know there are exceptions to this, in fact Apple has some of the most egregious exceptions (can anyone explain the UI on Aperture?) its far more consistent than the alternatives. The interesting thing is that this consistency didn't really transfer to iOS, in many cases you feel that they did try and provide a common way to do things, you imagine Apple designers asking "OK, we need to figure out how users should navigate this feature. So, how would Steve want to do it? ... Great, then well do it that way and the users will just have to behave like Steve.". However, it really didn't follow through, here are two annoying examples.

Settings, so where do I configure my app? some apps like the stocks and weather apps use the little "i" in a circle icon that you click to make changes which was taken from the Mac Dashboard. Some have settings buttons and pages in the app such as Twitter which I used a lot and finally some contributed settings into the "Settings" app. While the first has the advantage of locality, the last has the advantage of consistency, the very existence of three different models is confusing.

Delete, OK, this sounds really anal, but its a good indicator on how the user experience team thinks about common and repeatable tasks. In many apps I need to delete items, so how many gestures can the iOS team come up with to delete things from a list? Well, the Mail app does two things, the stocks and weather apps do something different, the Text app is slightly different and with this guidance apps have come up with even more schemes to do a simple thing like delete.

So, how did Android stack up? Well, first let's re-imagine our design conversation from earlier, "OK, we need to figure out how users should navigate the home page?". For Android you get the feeling the response was "Well the users kind of didn't know, they thought you might swipe from side to side, some said it would be cool if you could tilt the phone to make the icons slide about, others suggested that you could hold down your finger and speed scroll, others liked the on-screen organization but wanted a place to see all their apps alphabetized." . So the response? "Cool, lets do all of them!", So Android can be a little confusing at times when you put your finger on the screen expecting behavior #1 but because you left your finger in contact slightly longer than normal you invoke behavior #2. But I guess it does allow you to pick a style you like, after all with the iPhone if you dont think like Steve the experience can get very frustrating after a while.

So how did the Android do on the Settings and Delete tests? Well on settings it did well, Android clearly separates out two classes of settings, system settings and app settings. Systems settings are accessed via a stand-alone app and pretty much cover what you'd expect, including your accounts which then control email access and so forth. The addition of the dedicated menu button really helps, apps now have a consistent way to present a menu to users, including a settings option. Settings windows/views have a consistent look and feel and are on the most part well layed out. I changed font sizes on a number of app, adjusted sync settings in email and calendars and all those first-week tweaks without any fuss or confusion (except changing the default font on the SMS app which apparently is not possible).

As for delete, Android is no worse, but no better, Most apps support "press long enough on an item and get a menu, which include delete" behavior which is great, for one-by-one action. The gmail app and others leave a checkbox on the screen next to each item, when you start selecting items you get a delete button (or move or other action), Messages, Mail and others have a Delete menu option that then adds check boxes to the display although Mail has them on the left, Messages and Files put them on the right with a "select all" option.

So, bottom line, did I give up anything? not really, I got a better screen, a much faster phone, I like the ability to manage the home screen(s) with icons and widgets and all my cloud services (and yes, Amazon Video on Demand works just fine so was watching TV on the bus home too). Do I miss the comfort of the iPhone as a device i knew, and its intimate connection to my Mac, well yes, but right now am definitely impressed with both Android and the Galaxy.

Thursday, September 8, 2011

Google Code + Git = Fail

Well, I wanted to start a new project on Google Code and typically have used SVN before but as I'm using Git more and more on my machine for personal work it seemed a great chance to use their new Git support. So, I created a new project, set Git as my version control and went straight to the command line and cloned my new project.

% git clone https://myusername@code.google.com/p/myproject/ 
Cloning into myproject...
warning: You appear to have cloned an empty repository.

Well, that all seemed good, so I added a file, committed locally and tried a push ...

% vi LICENSE
% git add LICENSE 
% git commit
[master (root-commit) 4551a7d] Added file
 1 files changed, 1 insertions(+), 0 deletions(-)
 create mode 100644 LICENSE
% git push
fatal: https://code.google.com/p/myproject/info/refs not found: did you run git update-server-info on the server?

Oops, so what did that mean? I went to the web page for my project, to the Source tab and browsed the source and clicked on the "git" node; the Filename pane now displays "Error retrieving directory contents." which seems odd, after all there's nothing there but it should at least be able to display an empty project. So a little search suggested that there had been an issue with Google Code and projects that end with a trailing "/" when you cloned them, so I started all over but cloned https://myusername@code.google.com/p/myproject instead. The result? Absolutely the same.

So, I wondered, Google now recommends that you use the .netrc to store your login details and use a URL that doesn't include your email address. So, I added my details into .netrc and tried the whole process again. And this time? Absolutely the same result. So as far as I can tell, from my Mac, using Git 1.7.5.4, I cannot clone and push changes back to Google code :-(

I went to file a ticket and discovered someone else already had so I added some comments and now I guess I wait. You can follow along with the support ticket if you wish.

Monday, July 25, 2011

Add CSS to Dojo Widgets

Many Dojo widgets have custom CSS and you have to remember to include them either in your application HTML pages or in your application CSS file. The problem is that when we add such a widget into our application code we have to figure out whether it has it's own CSS we have to weave in. Usually we find this when we load the page and it looks a mess. So, a widget declares it's templates, it declares the i18n bundles it uses, why doesn't it also include the CSS as well?

What I want to be able to do is something like the following, add a require call to declare my stylesheet and have it dynamically added to the page.

dojo.provide('foo.bar.MyWidget');

dojo.require('dijit._Widget');
dojo.require('dijit._Contained');
dojo.require('dojox.dtl._Templated');

dojo.requireLocalization('foo.bar', 'MyWidget');

dojo_requireStylesheet('foo.bar', 'MyWidget', 'myWidgetClass');

dojo.declare('foo.bar.MyWidget', [dijit._Widget, dijit._Contained, dojox.dtl._Templated], {

    ...

So, I created the dojo.requireStylesheet method shown below. Right
now it's pretty simple, it takes a module name and the base name for the CSS stylesheet so that the result looks much like "requireLocalization". The additional parameter is a class name from the target CSS that we know exists, this allows the method to avoid attempting to load the stylesheet if it has already been loaded (either in code or in the HTML).

dojo.loadedStylesheets = {};
dojo.requireStylesheet = function(/*String*/moduleName, /*String*/styleSheetName, /*String*/expectedClass) {
 // summary:
 //  This call will loaded a CSS stylesheet dynamically, by adding a new
 //  link DOM node to the page head. The caller specifies the name of 
 //  the module owning the CSS, and the call assumes that all CSS files
 //  exist within a "resources" sub directory. The stylesheet name is
 //  again the base name only and the call will append ".css". The 
 //  purpose of this is to allow a requireStylesheet call to look and
 //  feel like existing calls such as dojo.requireLocalization. Note
 //  that best effort will be made to ensure that the same stylesheet
 //  is not loaded more than once and the caller can provide a class
 //  name from the target stylesheet that is expected to be loaded, if
 //  this class is found then it is assumed that the stylesheet has
 //  been loaded elsewhere.
 // moduleName:
 //  the base module name owning the stylesheet. All stylesheets live in
 //  a subdirectory named "resources" of the owning module.
 // styleSheetName:
 //  the base name, without the ".css" suffix, of the stylesheet.
 // expectedClass:
 //  a class in the target stylesheet to be used to test whether it 
 //  has already been loaded. Note that if this is undefined the call
 //  will always try and and load the stylesheet.
 //
 function onClassFound(classItem) {
  if (classItem === null || classItem === undefined) {
   var url = dojo.moduleUrl(moduleName + '.resources', styleSheetName + '.css');  
   var link = dojo.create('link', {
     type: 'text/css',
     rel:  'stylesheet',
     href: url},
        dojo.doc.getElementsByTagName('head')[0]);
   dojo_loadedStylesheets[styleSheetName] = true;
  }
 }
 if (moduleName !== undefined && styleSheetName !== undefined) {
  if (dojo_loadedStylesheets[styleSheetName] === undefined) {
   if (expectedClass !== undefined) {
    var classStore = new dojox.data.CssClassStore();
    classStore.fetchItemByIdentity({identifier: expectedClass, onItem: onClassFound});
   } else {
    onClassFound(undefined); /* force loading */
   }
  }
 }
}

Now your widgets have the following structure, and while a lot of Dojo widgets and user widgets use the "resources" directory the code above makes that a requirement in the same way that dojo.i18n requires the "nls" directory.

/ foo.bar
  +-- nls
      +-- Mywidget.js
  +-- resources
      +-- Mywidget.css
  +-- templates
      +-- Mywidget.html
  +-- Mywidget.js

TODO: one issue with this is that it is not "theme" aware, the issue today is that the notion of theme is encoded in the CSS (and image, and other) resource URLs. A common approach to the identification of the current theme and encoding the theme in the path would be really valuable - if anyone from Dojo is listening :-)

Thursday, January 20, 2011

Some OSGi JAX-RS tweaking

In my last post I managed to get my Felix OSGi run time to host some JAX-RS resources, but noted one issue then and had one issue since that were worth a little effort to understand.

Nasty JAX-B attributes

In the last post I noted that to get my data to serialize correctly I had to add JAX-B attributes to my domain classes to ensure that the objects were correctly serialized.

import javax.xml.bind.annotation.XmlRootElement;

@XmlRootElement
public class PlatformProperties {
    ...

First, I only ever product JSON so the whole "Xml" prefix for each attribute is kind of annoying, and secondly I'd just rather not have to provide any attributes at all if the runtime can figure out a sensible serialization for me. Now, the JAX-RS runtime has a provision for custom serialization, the notion of a MessageBodyWriter which can then be added to the list of classes for an application. The Jackson project does provide a nice implementation that can read/write JSON and doesn't need all those attributes - org.codehaus.jackson.jaxrs.JacksonJsonProvider.

The issue is that the current implementation of the OSGi Extender takes a list of classes from the bundle manifest to create the application. Because the list of classes is provided by the bundle the extender uses the bundle class loader to create the actual list of classes to provide to Jersey. Unfortunately the JSON provider isn't in my bundle so we can't list it in the JAX-RS-Classes manifest property as the class loader will fail. To get around this, I created the following simple subclass in my own bundle.

import org.codehaus.jackson.jaxrs.JacksonJsonProvider;

public class JsonProvider extends JacksonJsonProvider {

}

I then added this to the list of classes in JAX-RS-Classes and was able to serialize my domain classes nicely - sans attributes.

OSGi service can't be a resource

My next issue was that I had a nice OSGi service (in fact using declarative services) and some of it's methods would be nice to expose to turn the service effectively into a resource. So, starting with the obvious I just added my JAX-RS attributes to the implementation class for my service - which almost worked. My service started, and like a good citizen I had used a declarative service reference to inject the LogService into my service. However, when I used my browser to pull data from my resource I was rather surprised to get a NullPointerException as my service didn't even do anything yet, here it is.

@Path("/things")
public class MyServiceImpl implements MyService {

    private LogService log = null;

    @Override
    @GET
    @Produces("application/json")
    public List getIds() {
        log.log(LogService.LOG_INFO, "Retrieving list of IDs.");
        List idList = new LinkedList();
        return idList;
    }

    public void bindLog(LogService log)  {
        this.log = log;
    }
 
    public void unbindLog(LogService log) {
        this.log = null;
    }
}

It turns out the NPE was the first line of the getIds method, the call to the log service. Basically the OSGi runtime created a singleton object for the service and bound an instance of the log to my service correctly. However, because the default behavior for the OSGi extender is to pass in a list of classes to Jersey via the Application class then Jersey will construct a new copy of MyServiceImpl as it needs to, one that hasn't been provided with a log service so the log is null and ... boom.

So, now I have to separate out my resource from my service, a shame but not necessarily a problem. Now, it would be possible to take the instance of my service and pass it to Jersey as a singleton, but would make the implementation of the extender much more complex.