After raising this topic in PLANETS and us evangelizing at the OR2010 it was great to see quite a bit of practical approach to software archiving at Archives New Zealand.
When working in the area of long-term access to digital objects (and when your objects are anything more complex than your basic image file) a software library containing the original creating and rendering applications becomes inevitable. Many digital objects are not properly accessible with any modern software packages such as WordPerfect, WordStar or different types of database files. The first step when accesing older digital objects is to get an idea of their intellectual value or relevance by opening, for example, text documents in some editor or text processor. This may be of some use but the rendering results are (far) from the original: The number of pages can change significantly, garbage is added to the page content, footnotes are moved to the end of the file, formatting is completely lost. If this was what memory institutions served up as authentic versions of older files, it would leave the reader with a number of serious doubts regarding the authenticity as it is not easily provable that the original document wasn’t altered in a malicious way, that all original content is there and that significant meaning wasn’t lost.
This is much more serious for database files. It is sometimes possible to load them to a text or hexadecimal editor. But the displayed results depend heavily on the content of the database cells and often are not easy to interpret. And, in most cases all the relational information of the data contained is lost completely. Those are only two examples tested at the National Archive of New Zealand on born digital content from their holdings and holdings of contributing agencies.
Thus, for a number of reasons the Archive is investigating the feasibility of setting up an Software Library/Archive of relevant applications and has begun to acquire copies of relevant software while the wider implications of such a library (such as legal provisions) are being explored. As many of those applications could not be reliably executed on todays operating systems the relevant operating systems are archived with them too. Beside this certain additional software components are needed like the hardware drivers to install the operating systems to original reference hardware or to emulators like QEMU, VMware or VirtualBox. More items might be required in the library if the relevant digital objects of interest are compressed or combined into a package. E.g. the Archive holdings contained 39 Apple HFS formatted disks which itself contained the relevant material compressed with StuffIt, a popular but now nearly extinct archival format.
Nevertheless, there are a number of practical and legal issues are still to be solved.

Comments
Software Archiving
This is definitely a complex area.
At Archives NZ we are investigating the best way to catalogue/document the software so that in future automated emulation could be a achieved. The trouble is that there are so many interdependencies between applications/programs that they can be very complex to document.
I am following the work of the Software Sustainability institute quite closely with the hope that a useful ontology/metadata framework will come out of their work.
From a legal standpoint we are being as careful as possible, ensuring we have multiple copies of software when we want to emulate one copy and run another on original hardware for example. Clearly if this is to become business as usual some changes may have to be made to the the law or software providers will have to provide mechanisms for software preservation and continued access to be done on a large scale.
Susan Corbett wrote an interesting paper on the legal situation in New Zealand regarding preserving video games, many of the finding apply across other software also:
Digital Heritage: Legal Barriers to Conserving New Zealand’s Early Video Games – Susan Corbett Corbett S Digital Heritage: Legal Barriers to Conserving New Zealand’s Early Video Games, March 2007, vol 13, No 5, New Zealand Business Law Quarterly, 48-71
Unfortunately the legal situation is different in different parts of the world and so the issue is going to have to be tackled many times over.
Euan
Submitted by Euan Cochrane on 7 March 2011 – 9:00pm Permalink
Dumping the Old Media
The situation for X86 software is still relatively easy. The usual media are floppy disks and optical media. Archives have so far been able to source only one old 5,25″ PC style floppy disk drive that actually works, though. … and we got two of those in Freiburg in working order.
In a further step the contained media was dumped to image files onto hard disk of the digital software archive machine in order to have a backup copy (legally allowed in New Zealand) and be able to access the content at all. Fortunately this process went comparably well taken the age of the media into consideration. Even more than 20 year old 5,25″ disks were well readable. Nevertheless, the density of the media was pretty low with 360kByte. In some cases
dd_rescuehelped to dump the few disks producing reading errors.Another interesting finding were the differences found between software production releases of the same version number. E.g. Word Perfect 5.1 as an update and as standard release differed due to silent updates implemented by the vendor.
At the moment the software library in the present state ignores the Apple Macintosh and other platforms legacy. In the holdings survey at the Archive, items surfaced which might be not covered by the software collection that has been compiled so far, especially if this is extended on the holdings of other contributing public sector agencies such as scientific institutions who have been using computers for over 40 years.
Submitted by Dirk von Suchodoletz on 8 March 2011 – 11:31am Permalink
Internet Archive CLASP project
I just wanted to note the existence of the Internet Archive’s CLASP project, which aims to archive software. This effort is of particular importance because they have secured an exception to DCMA legislation which helps avoid many of the legal issues that are preventing progress elsewhere.
Submitted by Andy Jackson on 8 March 2011 – 4:48pm Permalink