dbm/
misc/
old-docs/
This is release 1.5 of GNU dbm.  Better documentation will be written
soon.  For now, this file briefly describes the contents of this
release and how to use it.

The files are:

COPYING		- Copying information.
README		- This file.

bucket.c, extern.h, falloc.c, findkey.c, gdbm.proto, gdbmclose.c,
gdbmconst.h gdbmdefs.h, gdbmdelete.c, gdbmerrno.h, gdbmfetch.c,
gdbmopen.c, gdbmreorg.c, gdbmseq.c, gdbmstore.c, global.c, gndbm.h,
hash.c systems.h, update.c, version.c - Source for GNU dbm library.

dbm.h dbminit.c, delete.c, fetch.c, seq.c, store.c - Source for the
DBM interface.

dbmclose.c, dbmdelete.c, dbmdirfno.c, dbmfetch.c, dbmopen.c, dbmpagfno.c
dbmseq.c, dbmstore.c, ndbm.h - Source for the NDBM interface.

Makefile 	- Makefile, will make gdbm.a  (BSD and SYSV)
testgdbm.c	- A simple test program.
testdbm.c	- A simple test program.
testndbm.c	- A simple test program.
conv2gdbm.c	- A dbm database conversion program.

CHANGES from 1.4 to 1.5:
  1.  Minor bug fixes.  See the ChangeLog.
  2.  Added gdbmconst.h to allow users to change the size of the
      bucket cache in the systems.h file.

CHANGES from 1.0 to 1.4:
  1.  Mainly bug fixes.
  2.  A define for "dbmclose()" was added to dbm.h for those few 
      implementaions that need that call.
  3.  For details, see the ChangeLog.

CHANGES from 0.9 to 1.0:
  1.  Makefiles were combined into one and a few new things added to it.
  2.  Several minor bugs were fixed including a cache bug.
  3.  Two new calls (dbm_pagfno, dbm_dirfno) were added to the NDBM interface.
  3.  A conversion program from dbm files to gdbm files was added.
  4.  Reorganize was changed to allow complex file names. (dir/file form)
  5.  testgdbm, testndbm, and testdbm were modified to return key and data
      pairs where needed and to take an optional file name as an argument.
      testgdbm had some command characters changed.
  6.  The DBM and NDBM interfaces were separated.
  7.  An include file for dbm users was added. (dbm.h)
  8.  The include file for ndbm users was renamed ndbm.h. (It was gndbm.h.)

CHANGES from 0.8 to 0.9:
  1.  The hash function changed.
  2.  The file format changed.
  3.  There was a complete rewrite of falloc.c.
  4.  There were added compatiblity routines for ndbm.
  5.  The file names for dbm compatibility routines were made to
      look like dbm.
  6.  Test programs changed.
  7.  Support for System V.
  8.  Various other small changes.
  9.  The need for recovery and associated code was removed.


GNU dbm is a set of database routines that use extendible hashing and
works similar to the standard UNIX dbm routines.  The basic unit of
data is the structure

  typedef struct {
             char *dptr;
             int   dsize;
          } datum;

The following is a quick list of the routines.  After this list, a
longer description of each routine will be given.  An include file
will be produced that can be included by the user. The file is
"gdbm.h".  The following routines are defined in terms of gdbm.h:

  GDBM_FILE gdbm_open ( name, block_size, read_write, mode, fatal_func )

  void gdbm_close ( dbf )
  
  int gdbm_store ( dbf, key, content, flags )

  datum gdbm_fetch ( dbf, key )

  int gdbm_delete ( dbf, key )

  datum gdbm_firstkey ( dbf )

  datum gdbm_nextkey ( dbf, key )

  int gdbm_reorganize ( dbf )


For compatibility with the standard dbm, the following routines are 
defined.  There is an include file for dbm users called "dbm.h".

  int dbminit ( name )

  int store ( key, content )

  datum fetch ( key )

  int delete ( key )

  datum firstkey ()

  datum nextkey ( key )


There are also compatibility routines for ndbm.  For ndbm compatiblity
routines, you need the include file "ndbm.h".  The routines are:

DBM *dbm_open (name, flags, mode)

void dbm_close (file)

datum dbm_fetch (file, key)

int dbm_store (file, key, content, flags)

int dbm_delete (file, key)

datum dbm_firstkey (file)

datum dbm_nextkey (file)

int dbm_error (file)

int dbm_clearerr (file)

int dbm_pagfno (file)

int dbm_dirfno (file)


Description of GNU dbm routines.
--------------------------------

GNU dbm allows multiple data files.  A routine that opens a gdbm file
is designated as a "reader" or a "writer".  Only one writer may open a
gdbm file and many readers may open the file.  Readers and writers can
not open the gdbm file at the same time. The procedure for opening a
gdbm file is:

GDBM_FILE dbf;

dbf = gdbm_open ( name, block_size, read_write, mode, fatal_func )

The parameters are:
  char *name - the name of the file (the complete name, gdbm does
	  not append any characters to this name)
  int block_size - the size of a single transfer from disk to memory.
	  This parameter is ignored unless the file is a new file.
	  The minimum size is 512.  If it is less than 512, dbm will
	  use the stat block size for the file system.
  int read_write - 0 => reader, 1 => writer, 2 => writer (if the database
	  does not exist, create a new one, 3 => writer and create a new
	  database regardless if one exists.  (Defined in gdbm.h as
	  GDBM_READER, GDBM_WRITER, GDBM_WRCREAT, GDBM_NEWDB.)
  int mode - file mode (see chmod(2) and open(2)) if the file is created.
  void (*fatal_func) () - a function for dbm to call if it detects a
	  fatal error. The only parameter of this function is a string.
	  If the value of 0 is provided, gdbm will use a default function.

The return value, dbf, is the pointer needed by all other routines to
access that gdbm file.  If the return is the NULL pointer, gdbm_open
was not successful.  The errors can be found in "gdbm_errno" for gdbm
errors and in "errno" for file system errors.  (For error codes, see
gdbmerrno.h.)

In all of the following calls, the parameter "dbf" refers to the pointer
returned from gdbm_open.

It is important that every file opened is also closed.  This is needed to
update the reader/writer count on the file.  This is done by:

  gdbm_close(dbf);


The database is used by 3 primary routines.  The first stores data in the
database.

  ret = gdbm_store ( dbf, key, content, flag )

  The parameters are:
     char *dbf - the pointer returned by gdbm_open
     datum key - the key data
     datum content - the data to be associated with the key
     int   flag - 0 => insert only, generate an error if key exists.
	          1 => replace contents if key exists.  (Defined in
	          gdbm.h as GDBM_INSERT and GDBM_REPLACE.)

  If a reader calls store, ret gets -1.  If called with GDBM_INSERT and
  key is in the database, ret gets 1.  Otherwise, ret is 0.
  NOTICE: If you store data for a key that is already in the data base,
  gdbm replaces the old data with the new data if called with GDBM_REPLACE.
  You do not get two data items for the same key and you do not get an
  error from gdbm_store.
  NOTICE: The size in gdbm is not restricted like dbm or ndbm.  Your data
  can be as large as your "want".
  NOTICE: Both key and content must have the dptr field be a non-NULL
  value.  Since a NULL dptr field is used by other routines to indicate an
  error, a NULL field cannot be valid data.  If either key or content have
  a null dptr field, gdbm_open will return an error.

To search for some data:

  content = gdbm_fetch ( dbf, key )

  The parameters are:
     char *dbf - the pointer returned by gdbm_open
     datum key - the key data

  The "datum" returned in content is a pointer to the data found.  If the
  dptr is NULL, no data was found.  If dptr is not NULL, then it points
  to data allocated by malloc.  gdbm does not automatically free this data.
  The user must free this storage when done using it.  This eliminates the
  need to copy the result to save it for later use. (You just save the 
  pointer.)

To remove some data from the database:

  ret = gdbm_delete ( dbf, key )

  The parameters are:
     char *dbf - the pointer returned by gdbm_open
     datum key - the key data

  The ret value is -1 if the item is not present or the requester is a reader.
  The ret value is 0 if there was a successful delete.


The next two routines allow for accessing all items in the database.  This 
access is not key sequential, but it is guaranteed to visit every key in
the database once.  (The order has to do with the hash values.)

  key = gdbm_firstkey ( dbf )

  nextkey = gdbm_nextkey ( dbf, key )

  The parameters are:
     char *dbf - the pointer returned by gdbm_open
     datum key - the key data

  The return values are both datum.  If key.dptr or nextkey.dptr is NULL,
  there is no first key or next key.  Again notice that dptr points to
  data allocated by malloc and gdbm will not free it for you. 

The following routine should be used very seldom.  
  
  ret = gdbm_reorganize ( dbf )

  If you have had a lot of deletions and would like to shrink the space
  used by the gdbm file, the this routine will reorganize the database.
  gdbm will not shorten the length of a gdbm file except by using this
  reorganization.  (Deleted file space will be reused.)


The following two are variables that may need to be used:

  gdbm_error gdbm_errno   -   the variable that contains more information
                              about gdbm errors.  ( gdbm.h has the
			      definitions of the error values. )

  char * gdbm_version     -   the string containing the version information.


There are a few more things of interest.  First, gdbm files are not
"sparse".  You can copy them with the UNIX cp command and they will
not expand in the copying process.  Also, there is a compatibility
mode for use with programs that already use UNIX dbm.  In this
compatibility mode, no gdbm file pointer is required by the user.
Only one file may be opened at a time.  All users in compatibility
mode are assumed to be writers.  If the gdbm file is a read only, it
will fail as a writer, but will also try to open it as a reader.  All
returned pointers in datum structures point to data that gdbm WILL
free.  They should be treated as static pointers (as standard UNIX dbm
does).  The compatibility routine names are the same as the UNIX dbm
routine names.  Their definitions follow:

  int dbminit ( name )

  int store ( key, content )

  datum fetch ( key )

  int delete ( key )

  datum firstkey ()

  datum nextkey ( key )

NOTE: Some implementations have an include file "dbm.h".  That file is just
a file that defines datum and the above routines.  Many original dbm 
sites do not have a "dbm.h" file.  One is included here for those who 
want it.

WARNING: standard UNIX dbm and GNU dbm do not have the same data format in the
file.  You cannot access a standard UNIX dbm file with GNU dbm!  If you want
to use an old database with GNU dbm, you must use the convert program.

Also, GNU dbm has compatibility routines for ndbm.  For ndbm compatiblity
routines, you need the include file "ndbm.h".  

WARNING: If you have ndbm and gdbm, there is a conflict in the names of the
include file for the ndbm interface and the original ndbm package.  Do not
blindly copy "ndbm.h" to your include directory.

The routines are:

    DBM *dbm_open (name, flags, mode)

    void dbm_close (file)

    datum dbm_fetch (file, key)

    int dbm_store (file, key, content, flags)

    int dbm_delete (file, key)

    datum dbm_firstkey (file)

    datum dbm_nextkey (file)

    int dbm_error (file)

    int dbm_clearerr (file)

    int dbm_dirfno (file)

    int dbm_pagfno (file)

Again, just like ndbm, any returned datum can be assumed to be static
storage.  You do not have to free that memory, the ndbm compatibility
routines will do it for you.


Notes on making GNU dbm.
------------------------

The "Makefile" will make both "gdbm.a", the collection of gdbm routines and
three simple test programs that uses the gdbm routines.  Two test programs
test the dbm and the ndbm interface routines.  The third was use to help
debug gdbm and contains "inside knowledge."   The first two can be linked
using the original dbm and ndbm routines.  The makefile also makes the
conversion program, conv2gdbm.   The make commands are:

   make gdbm.a		makes the gdbm.a archive.
   make testgdbm	makes both gdbm.a and the gdbm test program.
   make testdbm 	makes both gdbm.a and the dbm test program with gdbm.
   make testndbm	makes both gdbm.a and the ndbm test program with gdbm.
   make allgdbm		makes all of the above.
   make tdbm		makes the dbm test program linked with dbm, not gdbm.
   make tndbm		makes the ndbm test program linked with ndbm, not gdbm.
   make alldbm		makes the two previous programs.
   make conv2gdbm	makes the conversion program.
   make all		makes all the above.
   make install		installs gdbm.a as /usr/lib/libgdbm.a and gdbm.h.


CONV2GDBM
---------

The program conv2gdbm has been provided to help you convert from dbm
databases to gdbm.  The usage is:

   conv2gdbm [-q] [-b block_size] dbm_file [gdbm_file]

The optional "block_size" is the same as in gdbm_open.  The dbm_file is
the name of the dbm file without the .pag or .dir extensions.  The
optional gdbm_file is the complete file name.  If not included, the
gdbm file name is the same as the dbm file name without any extensions.
That is "conv2gdbm dbmfile" converts the files "dbmfile.pag" and
"dbmfile.dir" into a gdbm file called "dbmfile".  The -q option causes
conv2gdbm to work quietly.


System V support  (And other systems.)
--------------------------------------

There is now support for System V.  This done via the systems.h file.
This is the place where all system dependencies should go.  The 
makefile should make gdbm with a few changes.  The makefile describes
the changes necessary for use with System V.  Also, read the makefile
and edit it if you use gcc.  There are several places where changes
are needed to use gcc.
  1)  uncomment: #CC=gcc
  2)  change the lines to make the test programs. (See the comments.)

If you port gdbm to another system, try to follow the change style used for
System V changes.  Please send your changes to phil@cs.wwu.edu if you would
like your changes included in a future release of gdbm.

Please send bug reports to bug-gnu-utils@prep.ai.mit.edu.

Thank you.