My current research projects are documented in detail in my open Research Notebook:
Real words :: Imagined tweets
Have you ever wondered how the cut and thrust of parliaments past might translate to the world of social media? Wonder no longer, for here you can explore interjections in the Australian parliament from 1901 to 1980, reimagined as tweets. You might even find some emoji…
Closed Access dataset
2017 update! Complete dataset of records held by the National Archives of Australia that had the access status of ‘closed’ (withheld from public access) on 9 January 2017.
Demonstration code to harvest the Department of Foreign Affairs and Trade’s collection of historical documents and extract some metadata. The harvested documents are available in Markdown format and can be explored through a simple website.
People of Australia
@people_aus is a Twitter bot sharing random names drawn from late 19th and early 20th century naturalisation records held by the National Archives of Australia. Many names. Many cultures. These are the people of Australia.
RecordSearch Series Harvests
Code to harvest the metadata and digitised images of all items in a series from the National Archives of Australia. Data from an assortment of harvested series are available as CSV files.
Show Redactions userscript
Code for inserting details of redacted files into RecordSearch results.
Code used for the extraction of redactions and other experiments with digitised ASIO files.
Redactions extracted from ASIO surveillance records in National Archives of Australia Series A6119, <https://dx.doi.org/10.6084/m9.figshare.4101765.v1>
Non redactions dataset
False positives (non-redactions) extracted from ASIO surveillance records in National Archives of Australia Series A6119, <https://dx.doi.org/10.6084/m9.figshare.4104651.v1<
Invisible Australians browser
Updated code and website providing an experimental browser for digitised records from the National Archives of Australia relating to the administration of the White Australia Policy. Now includes a landscape view for exploring records by their orientation.
Closed Access harvester
Updated code for harvesting and analysing records from the National Archives of Australia with the access status of ‘closed’.
Closed Access dataset
Complete dataset of records held by the National Archives of Australia that had the access status of ‘closed’ (withheld from public access) on 1 January 2016.
Closed Access website
Public web interface for the exploration, analysis, and visualisation of ‘closed’ records in the National Archives of Australia.
Commonwealth Hansard XML repository
A repository of the (almost) complete proceedings of the Commonwealth House of Representatives and Senate from 1901–1980. This comprises several gigabytes of XML-formatted files harvested from the ParlInfo database.
A public website that presents the proceedings of the Commonwealth House of Representatives and Senate from 1901–1980 in a form that is optimised for browsing and reading. It includes additional features such as indexes to people and legislation, and the integration of tools for text analysis and annotation. Documentation is also provided.
DIY Headline Roulette
Code and documentation that makes it easy for anyone to create their own simple game using Trove’s digitised newspapers.
Radio National program data
Updated dataset of programs broadcast on Radio National from 2000–2016 harvested from Trove.
PMs Transcripts repository
Repository of more than 20,000 XML transcripts of speeches by Australian Prime Ministers harvested from the PMs Transcripts site.
UMA Ellis Photos
Repository of data and images from a collection of political photos by John Ellis held by the University of Melbourne Archives. Harvested using the Trove API.