NISSAT-NCSI Workshop on

Developing Digital Libraries using Open Source Software

15.4.2002 to 20.4.2002


Free /Open Source Software for Library and Information Management

Compiled by : Suvarsha Walters and Madhuresh Singal, National Centre for Science Information, IISc, Bangalore.


Disclaimer: This compilation is non-exhaustive. We do not endorse the accuracy or reliability of any of the information provided here.

If you are interested in more information on Free / Open Source Software Visit
OSS4lib at www.oss4lib.org
SALIS (Software And Licensing Information System) at http://salis.ece.iisc.ernet.in
Freshmeat at www.freshmeat.net
Sourceforge at www.sourceforge.net

Compiled list of Software
Automated Classification and Thesarus Construction
Content Management / Knowledge Management
Database Systems
Indexers, Information Retrieval and Digital Library Packages
Library Automation
Miscellaneous

Codes :    : Freeware | : Open Source

1
Automated Classification and Thesarus Construction
  Software Title Operating Systems Description
ExtPhr32 Windows 32 Extracts every word and every phrase up to a certain length that occurs at least a minimum number of times in a source text and that does not start or end with a stopword.
NEPHIS32 Windows 32 Incorporates the classic NEPHIS string indexing system, with some additional formatting features, including production of indexes in HTML format.
TexNet32 Windows 32 Assists in the writing of abstracts and other short summaries. Includes word and phrase extraction and various other capabilities.
TheW32 Windows 32 Assists in creating, modifying, and printing out a thesaurus. Allows users to define their own link types and report formats. Includes a template with definitions of the standard SN, USE, UF, BT, NT, and RT relation types and alphabetical, short, and tree report formats.
XRefHT3 Windows 32 Assists in human indexing of Web documents. Produces index displays in HTML form, including links to Web documents. Allows for automatic target extraction, target and link insertion, and production of tables of contents in HTML files and insertion of certain other HTML codes in plain text files.
2
Content Management / Knowledge Management
  Software Title Operating Systems Description
Apollo CMS Unix like systems Apollo CMS is a content management system written in Python, using PostgreSQL as a backend. It is written in a modular way allowing for new types of content, and new database backends
Apache::PageKit
Unix like systems PageKit is a modular application framework that uses HTML::Template and XML to separate the design from the content. It includes session management, authentication, form validation, co-branding, and a content management system.
SPIN CMS Unix like systems SPIN CMS is a basic and simple content management system for newbies and those wanting something up and running quickly. It can also store images and other binary files. The admin mode allows one to add, view, edit, and delete content arranged in multiple categories. Various output options are available for 'public' pages.
Based on MySQL and PHP 4.
Baseit Unix like systems basit is a Web site content management system developed using PHP. The package provides a Web-based user interface to configure all the attributes of the Web site and add/edit/delete articles. A full usage manual including setup, administration, and maintenance is provided. The package doesn't need any database support on the Web server.
BaseML Unix like systems BaseML is a tool/compiler to manage and construct Web sites. To create a Web site, you provide BaseML with a set of "source" XML files that contain the styling and contents of your web site. The source code format is a mix between XML/XHTML components and procedural programming.
AStA-CMS Unix like systems AStA-CMS is a simple content managment system (CMS) which allows you to manage the contents of your Web site in a well-structured manner, based on an hierarchiecal category tree. It also allows you to store files and images in a database. It features multi-user functionality and the ability to manage articles, files, and images easily. It is based on templates, and thus provides a proper separation of PHP code and visual design.
asCMS   asCMS is an easy-to-use, powerful PHP-based content management system.
Ascend Unix, Windows, MacOS Ascend is an advanced profesional Web content manager. It supports multiple OSes, multiple database connections, multiple languages, multiple presentation schemes, multiple browsers, XLM integration, a Java editor, advanced security management for both users and administrators, integrated CRM, and CVS integration. It can also use CGI scripts or ascend-specific plugins to extend the software for tasks such as e-commerce, portals, intranets, Unix system administration, and editorial systems.
Arsdigita CMS OS Independent Arsdigita CMS is a powerful content management system. It has a task list for production staff to track their assignments and the status of their current work items; a site map browser to view and organize pages and content items and determine access control to branches of the site; a standard interface for creating, editing, approving, and deploying content items; a template manager for creating, editing, and organizing presentation templates and related assets; a metadata manager for viewing and defining content types and associations; a category browser for managing a hierarchy of subject headings that may be applied to content items; and administrative and management tools for creating and editing user attributes and tracking global workflow statistics.
ArsDigita Community System
Windows, Linux, SunOS/Solaris The ACS (ArsDigita Community System) is a sophisticated, scalable platform for Web development, with support for content management, commerce, workflow, messaging, session management, permissions, bulletin boards, and many other applications.
Jake Unix like systems jake (Jointly Administered Knowledge Environment) is a reference source which makes finding, managing, and linking online journals and journal articles easier for students, researchers, and librarians. jake does this by managing online resource metadata with a database union list, title authority control, linking tools, and a local holdings layer.
Grub Unix like Systems Grub provides a distributed crawling client, which is used to create an infrastructure (database + volunteers) that will eventually provide URL update status information for nearly every web page on the Internet. Grub's distributed crawler network will enable websites, content providers, and individuals to notify others that changes have occurred in their content, all in real time.
Zope Windows9x, NT, Unix like systems Zope is the leading Open Source web application server. Zope enables teams to collaborate in the creation and management of dynamic web-based business applications such as intranets and portals. Zope makes it easy to build features such as site search, news, personalization, and e-commerce into your web applications.
Nesstar Window NT Server/Workstation with Service Pack 4 or later applied and under Windows 95/98/2000.
NESSTAR is an infrastructure for data dissemination via the Internet. Nesstar Explorer offers an end user interface for searching, analysing and downloading data and documentation. Nesstar Server offers tools and resources for making data and documentation available via the Internet.
3
Database Systems
  Software Title Operating Systems Description

XBase

GNU Linux, Windows 98, Windows 95, Windows 2000
Xbase DBMS is a collection of specifications, programs, utilities and a C++ class library for manipulating Xbase type datafiles and indices which has been developed and produced by many people from around the world using the internet. This project was formerly known as Xbase for Linux, but as I have recieved input from several people who are compiling this on platforms other than Linux, I have renamed it to Xbase DBMS. The main development of this library however, remains on the Linux platform utilizing the GCC public domain C/C++ compiler.

Haccess 97

Windows
A reverse engineer the Microsoft access ( MDB / jet) database format, and provide full documentation and cross platform ansi C++ read-write libs to support SQL, Cursors, ODBC drivers, GUI's and data-recovery.

genSQL

GNU Linux, Unix
genSQL is a generic SQL database UI, written in C++ using the GNOME--/GTK-- widget sets. Its aim is to provide a complete and easy-to-use interface to various SQL databases.To create a generic SQL GUI for Linux/UNIX platforms.Opening and closing multiple databases with different types, Viewing the list of tables in the databases, Dropping tables, Viewing table data Loading and unloading GQL database drivers.

MDBMS

GNU Linux
MDBMS is a new relational database management system. It is very clean and small, yet powerful enough to do complex database work. MDBMS is currently available in source and binary form, for Linux and BSD systems. MDBMS is known to work correctly on Alpha-linux and Sun Solaris machines.

SQLite

Unix-like systems, Windows
SQLite is a C library that implements an SQL database engine. A large subset of SQL92 is supported. A complete database is stored in a single disk file. The API is designed for convenience and ease of use. Applications that link against SQLite can enjoy the power and flexiblity of an SQL database without the administrative hassles of supporting a separate database server. Because it omits the client-server interaction overhead and writes directly to disk, SQLite is also faster than the big database servers for most operations. In addition to the C library, the SQLite distribution includes a command-line tool for interacting with SQLite databases and SQLite bindings for Tcl/Tk.

Mimer SQL

GNU Linux, Unix, Windows NT, Windows 98, Windows 95, Windows 2000
Mimer SQL is an SQL-based, high-performance RDBMS, targeted at mission-critical client/server environments. It offers scalable performance (including multi-processor support), and is available in all major Unix (including Linux), Windows, and OpenVMS platforms. It offers Optimistic Concurrency Control as the transaction handling method, instead of the more commonly used locking-type.

WebMySQL

GNU Linux, Unix
WebMySQL is a Web interface to a MySQL database server. It supports the following SQL queries: SELECT, DESCRIBE, JOIN, and SHOW, and advanced query options such as LIMIT, DISTINCT, and GROUP. It has a fully customisable look and feel, and outputs either HTML results or a downloadable CSV file.

JCDatabase

GNU Linux, Unix, Windows NT, Windows 98, Windows 2000
JCDatabase, Direct File Database Management System has the following advantages over JDBC: 1. JCDatabase does not require you to run a database server or configure your system to provide access to a local database and; 2. JCDatabase can be very conveniently be deployed as part of a traditional client-side application using direct file access.

Maccess

GNU Linux
Maccess helps to analyse data in databases; it is a Web interface to an SQL server. Its main features are user friendly query creation for SQL server; selective, grouped, and crosstab queries; various outputs (XML, HTML, TXT, graphs, and Europe map); data ungrouping in grouped queries; creation of Web links and definition of CSS styles for all values in output; ability to format numbers, dates, and times; and input and data update.

GigaBase

GNU Linux, Unix, Windows NT, FreeBSD, OS/2, Windows 98, Solaris, MAC OS, AIX
GigaBASE is object-relational embedded database engine for C++ applications. It provides SQL-like query language, smart C++ interface (loading objects instead of tupples), transaction based on shadowing page algorithm (no separate log file and very fast)

Dbreport

GNU Linux, Unix
dbreport creates a PostScript report of a table or view within a database. The database is accessed via unixODBC, and the format for the report is specified by an XML file.

NetSQL

GNU Linux, Solaris
NetSQL is a light and robust SQL database server for Linux and Sun Solaris. Because it can be configured on a virtual web server, it enables every Webmaster with a virtual web site to host database applications. It comes with a very intuitive web application builder, DataDiplomat, which not only has a visual form editor, CGI script generator and installer, but also provides necessary ODBC database support. With DataDiplomat application builder, you can create searchable web sites with real-time updating capability without programming.

ACDK SQL

Windows NT, Solaris, Windows, Windows 2000
ACDK is a platform-independent library for generating distributed server-based components and applications. acdk_sql contains the driver model for accessing SQL databases, which is quite similar to the JDBC interface.

Bonddb

Unix like systems
Bonddb is a object oriented wrapper for postgresql SQL. It's a fast data abstraction layer written in C for C/C++ applications to allow easy access to class objects. Bonddb is not a stand alone object orientated database, but works on top of SQL so you still get a solid proven backend with the benefits of things like prescient objects.

GOODS

Windows
GOODS is an object-oriented fully distributed database management system using an active client model. Its multithreaded high-performance database server is language and application independent. The client application interface is built using meta-object protocol and provides transparent persistency for normal programming languages, and the aspect-oriented programming approach used for implementing object access and synchronization policies makes it possible to define various database access strategies fitting requirements of concrete application and separate them from application code itself.

KDBMS

Unix-like systems
KDBMS is a database management system with its own interpreted scripting language supporting WEB/WAP/IVR. It also supports multiprocessing, shared memory, and many other things.

db4o (Database for Objects)

Platform Independent
db4o (database for objects) is a client/server Java object database. It features transactions, automatic class recognition, query-by-example, callback methods, memory files, cascade-on-delete, multi-threaded access, and Java Webstart and servlet support.

UML2SQL

Platform Independent
UML2SQL is a Java tool which synchronizes SQL databases with UML diagrams. UML class diagrams can thus serve as authoritative definitions of database catalogs. UML2SQL can be used together with a UML design tool which is capable of producing XMI. For the most generic operations, only a JDBC driver is required for the SQL database. Vendor specific DDL commands for unsupported databases can be easily implemented by extending the DatabaseAdapter class.

Orac DB Tool

Platform Independent
Orac is a GUI tool for Database Administrators written in Perl, Tk, and DBI. It organizes many useful DBA scripts into a nice graphical interface, for collecting statistics on your database, or diagnosing problems. It currently supports Oracle, Sybase, and Informix. Richard Sutherland\

FutureSQL Tool

GNU Linux, Unix
FutureSQL is a Rapid Application Development web database administration tool written in Perl. FutureSQL allows one to easily setup config files to view, edit, delete and otherwise process records from a MySQL database. It uses a data dictionary, configuration files and html templates, and allows \"pre-processing\" and \"post-processing\" on both fields, records and operations. It allows multiple views and operations on a data set, including the use of joined tables for queries and reports. A demo application with most of the features is included.

Qddb

GNU Linux, Unix, FreeBSD, Solaris, HPUX, Sun OS, Open BSD
Qddb is powerful database creation software. Using Qddb, you can quickly and easily create database applications. nxqddb allows you to create a complete database application in less than five minutes, and Hx lets you manage database information over the web through a CGI interface. Larger applications can also be developed using Fx, a powerful TCL/Tk toolkit.

MaVerick - MultiValue Database Management System

Platform Independent
MaVerick is a MultiValue database management system.MultiValue databases can be relational, but their method of storing data is much more flexible than the 2 dimensional table you normally associate with relational databases, i.e., you can store multiple fields within a field. On top of which, they are extremely fast in operation.

XDBM

Windows
XDBM is a database designed specifically to handle XML data. It offers greater speed (since the data is stored in a preparsed format), lower memory usage (since only those parts of the file needed are loaded into memory) and faster searching for XML elements. XDBM can also parse and store plain text XML files. XDBM aims to be as close to the W3\

LocalSQL

GNU Linux, Windows
LocalSQL provides a library and command line tool for direct invocation and processing of SQL statements on local databases. A subset of SQL92 is implemented, notably 'INSERT UPDATE DELETE SELECT CREATE TABLE CREATE INDEX DROP TABLE'.

RedBase Pure Java RDBMS

Platform Independent
RedBase Pure Java RDBMS is a 100% Pure Java database, with an ultra-compact footprint designed for rapidly developing applications that extend enterprise data management capabilities, to mobile and embedded devices. It is ideal for mobile, wireless, and embedded applications, and it delivers essential relational database functionality in a small footprint, while providing flexible data access, and the familiar feel through entry SQL-92 compliance, and JDBC access.

Scientific Image Database (SIDB)

GNU Linux
SIDB (Scientific Image Database) is a Web-driven database for (scientific) images. Entry of image meta-data is facilitated through the use of user-definable templates. Users have complete control over who else may see the submitted image and data. SIDB offers various views on the actual image data. Thumbnail plus meta-data can be printed, and multiple thumbnails can be combined in galleries. A freeware version of the Huygens software (http://www.svi.nl) can be used to generate projections from 3-D images (most confocal microscope image formats are supported), as well as MPEG movies, showing the individual layers from the 3-D images.

Ozone Java ODBMS

Platform Independent
ozone is a transaction-based, object-oriented database management system completely implemented in Java and distributed under an open-source license. It aims to evolve a technology that allows developers to build pure object-oriented, Java database applications. ozone also contains a full-featured XML API that allows you to use it as an XML repository.

Lago Database

GNU Linux, FreeBSD, Windows
Lago is a portable (Linux/Windows), multi-threaded database written in C++.
4
Indexers, Information Retrival and Digital Library Packages
  Software Title Operating Systems Description

FreeWAIS-sf

Unix-like systems
FreeWAIS is a text retrieval system based on the vector space model. freeWAIS-sf is an extension of freeWAIS, developed at the computer science department of the University of Dortmund. sf means structured fields and indicates the main improvement of freeWAIS-sf.The SF suffix in the software name stands for structured fields an indexing and search feature which distinguishes this software from its predecessors.

Greenstone Digital Library Software:

Unix, Windows NT, Windows 98, Windows 95
Greenstone is a suite of software which has the ability to serve digital library collections and build new collections. It provides a new way of organizing information and publishing it on the Internet or on CD-ROM. Greenstone is produced by the New Zealand Digital Library Project at the University of Waikato, and distributed in cooperation with UNESCO and the Humanities Library Project.

Harvest Web Indexer

Unix like Systems
The Harvest indexer offers a distributed solution to the problems of indexing data made available on the web. With each web server running a local Gatherer feeding into a central Broker many of the problems of web crawling are avoided. The Harvest Indexer can fetch and index data made available by HTTP, Gopher, FTP, or NNTP. It has summarisers capable of indexing data in a wide variety of file formats.

ht://Dig

GNU Linux, Solaris, HPUX, Sun OS, IRIX
ht://Dig system is a complete indexing and searching system for a domain or intranet. It has the ability to search through many servers on a network by acting as a WWW browser.

Perlfect

GNU Linux, Unix-like systems, Windows
Perlfect Search is a customizable and effective site indexing/searching suite. It comes as a pair of disctinct scripts. The indexer, that automatically scans and indexes a web site, and the search engine, a cgi script that serves search queries for keywords over the index, and displays results pages in html, in a standard format including relevance ranking for each matching document.

Swish-E

Unix-like systems, DOS, Windows NT
The Simple Web Indexing System for Humans-Enhanced is a system for indexing collections of Web pages or other text files. The AND, OR, and NOT operators are supported, words can be truncated using *, and searches can be limited to particular fields or to certain HTML tags. Index files consist of only one file, so they are portable and can be easily maintained.

Isearch

GNU Linux, Unix, FreeBSD, Solaris, HPUX, IRIX, Ultrix
Isearch is an open source software package for indexing and searching text documents. It supports full text and field based searching, relevance ranked results, and Boolean queries, and it can index many kinds of documents, including HTML, mail folders, list digests, and text with SGML-style mark up. Additional document types can be added as C++ modules. Isearch also includes CGI tools to provide a web interface for searching.

mnoGoSearch

GNU Linux, Unix, Unix-like systems, FreeBSD, Solaris, HPUX, Sun OS, Open BSD, Windows
Formerly known as UdmSearch, mnoGoSearch is a full-featured SQL based web search engine. It has number of unique features, which make it capable of wide range of application - from search within your site to a specialized search system.

Webinator

GNU Linux, Windows NT, FreeBSD, Solaris, HPUX, IRIX, AIX, Windows 2000, DEC UNIX, Tru64
The Webinator is a sophisticated Web index and retrieval package. It allows the Website administrator to create and provide a high quality retrieval interface to collections of web documents

Zebra

Unix-like systems, Windows
Zebra is a fielded free-text indexing and retrieval engine with a Z39.50 frontend. You can use any compatible, commercial or freeware Z39.50 client to access data stored in Zebra.

Amberfish

Unix like systems
Amberfish™ is Etymon's new high performance text retrieval software.

Locus

Linux
Locus is a fulltext database search engine
Cheshire II Unix like systems The Cheshire II project is developing a next-generation online catalog and full-text information retrieval system using advanced IR techniques. This system is being deployed in a working library environment and its use and acceptance by local library patrons and remote network users are being evaluated. The Cheshire II system was designed to overcome twin problems of topical searching in online catalogs, search failure and information overload as well as to provide a bridge between the purely bibliographic realm of previous generations of online catalogs and the rapidly expanding realm of full-text and multimedia information resources. The system incorporates a client/server architecture with implementations of current information retrieval standards including Z39.50 and SGML and XML.

OCLC Site Search Windows NT Server, Unix like systems The OCLC SiteSearch™ software provides a comprehensive solution for managing distributed library information resources in a World Wide Web environment. It offers tools that integrate electronic resources under one web interface, provide flexible access to resources, and build text and image databases locally.
Ganesha Digital Library Windows 98, NT, 2000 Features: Distributed Knowledge Management,Centralized Knowledge Distributiont,
Online Member Registration, Roaming Membership, Searching, Category-Based Organization, Upload Metadata and Files, Personal Directories, Review Forum, Access Restriction, Image Thumbnail, Knowledge Organization, News, Synchronization, Member and Group Administration, Statistics, Advertisement, Dublin Core / IndonesiaDLN Metadata, XML Based Transaction, CD-ROM Enabled
5
Library Automation
  Software Title Operating Systems Description
PhpMyLibrary Linux PhpMyLibrary is a PHP MySQL Library automation application. The program consist of cataloging, circulation, and the webpac module. The programs also has an import export feature. The program strictly follow the USMARC standard for adding materials.
KOHA   Claiming to be the first Free Open Source Library System
6
Miscellaneous
  Software Title Operating Systems Description
  Concordance Unix like Sytems Concordance is a program that scans a text file and outputs concordance lines based on a node entered by the user. There are two web-based versions of Concordance: one in PERL and the other in C.

  Rakim (Rakim A Knowledge Instant Messanger) Unix like Systems rakim is an attempt at providing a web-based reference service. It is very similar to standard chat room scripts, but with some added features that might be useful when trying to help patrons find answers to their questions.
  WIBS Unix like systems

WIBS stands for Windsor Internet Booking System. It is based on the wonderful MRBS (Meeting Room Booking System) and is designed to mediate access to public stations in libraries. WIBS features include:
- uses your browser for logon and time-control
-is platform-independent
-allows patrons to book stations on the web

  Reference Desk Manager (RDM) Unix-like System RDM features are:

Email weblog -- with search feature
Electronic Card File -- with search feature
Common Links Area
Web-based Administration

  ROADS Unix-like System ROADS is a set of software tools to enable the set up and maintenance of Web based subject gateways.