A digital library is a library in which a significant proportion of the resources are available in machine-readable format (as opposed to print or microform), accessible by means of computers. The digital content may be locally held or accessed remotely via computer networks. In libraries, the process of digitization began with the catalog, moved to periodical indexes and abstracting services, then to periodicals and large reference works, and finally to book publishing. Some of the largest digital libraries are purely digital having few if any physical holdings
The term Digital Library is diffuse enough to be applicable to a wide range of digital entities. Divisions can be made between libraries that have some physical presence where patrons are able to access physical holdings as well as digital holdings and libraries where collections are almost completely digital. Project Gutenberg, ibiblio, International Children's Digitial Library and the Internet Archive can serve as examples of this later case.
A digital library is a library in which a significant proportion of the resources are available in machine-readable format (as opposed to print or microform), accessible by means of computers. The digital content may be locally held or accessed remotely via computer networks. In libraries, the process of digitization began with the catalog, moved to periodical indexes and abstracting services, then to periodicals and large reference works, and finally to book publishing. Some of the largest digital libraries are purely digital having few if any physical holdings
The term Digital Library is diffuse enough to be applicable to a wide range of digital entities. Divisions can be made between libraries that have some physical presence where patrons are able to access physical holdings as well as digital holdings and libraries where collections are almost completely digital. Project Gutenberg, ibiblio, International Children's Digitial Library and the Internet Archive can serve as examples of this later case.
Traditional libraries are limited by storage space; digital libraries have the potential to store much more information, simply because digital information requires very little physical space to contain it. As such, the cost of maintaining a digital library is much lower than that of a traditional library. A traditional library must spend large sums of money paying for staff, book maintenance, rent, and additional books. Digital libraries do away with these fees.
Digital libraries can immediately adopt innovations in technology providing users with improvements in electronic and audio book technology as well as presenting new forms of communication such as wikis and blogs.
Some people have criticized that digital libraries are hampered by copyright law, because works cannot be shared over different periods of time in the manner of a traditional library. The content is, in many cases, public domain or self-generated content only. Some digital libraries, such as Project Gutenberg, work to digitize out-of-copyright works and make them freely available to the public. An estimate of the number of distinct books still existent in library catalogues from 2000B.C. to 1960, has been made [1].
Other digital libraries (more specifically, digitial collections, which may be acquired by libraries) accommodate copyright concerns by licensing content and distributing it on a commercial basis, which allows for better management of the content's reproduction and the payment (if required) of royalties.
Digital libraries cannot reproduce the environment of a traditional library. Many people also find reading printed material to be easier than reading material on a computer screen although this depends heavily on presentation as well as personal preferences[2]. Also, due to technological developments, a digital library can see some of its content become out-of-date and its data may become unaccessible.
Access to digital libraries and their collections is dependent upon a stable information technology infrastructure (power, computers, communications links etc.). Hence, despite the egalitarian potential of the digitial library, many of those who could most benefit from its global reach (for instance in the Third World) are not able to do so.
Many academic libraries are actively involved in building institutional repositories of the institution's books, papers, theses, and other works which can be digitized. Many of these repositories are made available to the academic community or the general public. Insitutional repositories are often referred to as digital libraries.
Archives differ from libraries in several ways. Traditionally, archives were defined as:
The technology used to create digital libraries has been even more revolutionary for archives since it breaks down the second and third of these general rules. The use of search engines, Optical Character Recognition and metadata allow digital copies of individual items (i.e. letters) to be cataloged, and the ability to remotely access digital copies has removed the necessity of physically going to a particular archive to find a particular set of records.
Cornell University and the Wisconsin Historical Society are considered leaders in the field of digital archive creation and management.
Most digital libraries provide a search interface which allows resources to be found. These resources are typically deep web (or invisible web) resources since they frequently cannot be located by search engine crawlers. Some digital libraries create special pages or sitemaps to allow search engines to find all their resources. Digital libraries frequently use the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) to expose their metadata to other digital libraries, and search engines like Google can also use OAI-PMH to find these deep web resources.
There are two general strategies for searching a federation of digital libraries:
Distributed searching typically involves a client sending multiple search requests in parallel to a number of servers in the federation. The results are gathered, duplicates or eliminated or clustered, and the remaining items are sorted and presented back to the client. Scalability and performance issues tend to plague distributed searching for large federations of digital libraries. Protocols like Z39.50 are frequently used in distributed searching.
Searching over previously harvested metadata requires the pooling of metadata collected from every digital library in the federation. This solution scales better than distributed search, but it introduces the problem of data freshness; digital libraries need to be re-harvested on a periodic basis to discover new and updated resources. OAI-PMH is frequently used by digital libraries for harvesting metadata.
Here we will discuss the framework of a digital library to understand the internal structure of some digital library. I will give the example of Greenstone Digital Library Software which is free and open source digital library building software
Large scale digitization projects are underway at Google, the Million Book Project, MSN, and Yahoo!. With continued improvements in book handling and presentation technologies such as optical character recognition and ebooks, and development of alternative depositories and business models, digital libraries are rapidly growing in popularity as demonstrated by Google, Yahoo!, and MSN's efforts. Just as libraries have ventured into audio and video collections, so have digital libraries such as the Internet Archive.