Glossary of Data Management & Technical Terms


Fall 2000


This article focuses is on the language of data management. What is a "primary key"? What does "machine language" mean?  For the definition to these and scores of other data management and technical terms, keep on reading.


Aggregate Data. Data that summarizes or combines other data for analytical and research purposes.

Administrator. Someone with special network privileges allowing the setup and control all or part of a computer networks system.

Alert. Notification that an event has exceeded a predetermined threshold.

Algorithm. A defined sequence of actions taken to accomplish a task or solve a problem. See fuzzy logic.

ASCII. American Standard Code for Information Interchange. Binary codes representing alphabetic and numeric information on a computer. It has 128 codes for upper and lower-case letters, numbers, punctuation, and special control characters.

Authorization Rules. Criteria used to determine whether an individual or group may access data, software, and other services on a computer network.

Bar Coding. An identification technology that encodes data into groups of variously patterned bars and spaces and sometimes numbers. These are scanned and read into computer memory.

BASIC. Beginner's All-Purpose Symbolic Instruction Code. A simplified computer programming language widely used by beginners and non-professionals.

Benchmark. A fixed point of reference used as a standard for comparison. It is an outstanding example appropriate for use as a model. Also, a standardized test used as a basis to evaluate and compare the performance of computers and peripheral devices and components.

Binary Code. A representation of information using a sequence of zeros and ones that is the basis for calculations for all computers.

Bit. A digit in binary notation equivalent to the result of a choice between two alternatives such as one or zero, yes or no, etc.

Boolean Logic. A process of reasoning using symbolic relationships designed to permit mathematical calculations. It incorporate sets, propositions, and on-off circuit elements such as AND, OR and NOT. It is also called Boolean mathematics and Boolean algebra.

Browser. A software application used to access internet sites and read HTML documents in a user-friendly format. The most common are Microsoft Internet Explorer and Netscape Navigator. See home page, WWW.

Buffer. An area of computer memory that temporarily holds data being transmitted between a peripheral device and components and the central computer processor to allow for differences in work speeds.

Bus. A high-speed passageway shared by signals from the components of a computer.

Byte. A group of eight binary digits processed as a unit by a computer. It is generally the amount of computer memory needed to hold an alphabetic character.

C, C+, C++. High-level computer programming languages designed to produce a efficient translation of a program into machine language. These are widely used by professionals.

Cache Memory. High-speed, buffer-type computer memory used for storage of frequently used instructions or data.

CAD. Computer-Aided Design. Two- and three-dimensional design using computer software. It is used to make models of physical objects such as buildings and mechanical parts.

CD-ROM. Compact Disc. A portable disc utilizing Read-only Memory. It stores data that can be read by a computer.

Checksum. An computerized value at the end of a block of data corresponding to the sum of all information in the block. It is used to detect errors and data corruption.

Client. A computer workstation on a network. The client provides a user interface and processor to work with software, data, and other services stored on a server. See client application, client/server.

Client Application. A software application on client computers on a network used to extract or download software applications, data, and other services stored on a server.

Client/Server. A common network distributed technology approach where software is split between server tasks, providing shared functions and client tasks, providing individual user functions. A client sends requests to a server and the server responds.

Coaxial Cable. Insulated transmission line that offers a large bandwidth and the ability to support high data-transfer rates. It is used to transmit high-frequency telegraph, telephone, and television signals.

COBOL. Common Business Oriented Language. A high-level computer programming language commonly used for data processing and business applications. It is considered closer to English than many other high-level languages.

Column. A vertical string of fields beginning on the top of a spreadsheet and continuing to bottom. Compare field, record, row.

Compatibility. The ability for two computer devices or components to work with each other. Also, the ability of software applications to run on a particular hardware platform.

Compiler. A program that translates a high-level language program into machine language that can be understood by a computer.

Configuration. A particular setup, including options chosen and the arrangement of peripheral devices and components, for computer hardware and software applications.

Control Data. Data that guides and controls a process, such as flags, counters, and other indicators.

Coprocessor. An extra processor in a computer designed to perform specialized tasks more efficiently than the CPU. They are commonly used to speed up mathematical calculations.

CPU. Central Processing Unit. The main part of a computer. It controls the operation of the entire computer system and executes the arithmetical and logical functions contained in software program. Compare coprocessor. See motherboard, multiprocessing.

Crosstab. A simplified form of data analysis that combines and summarizes information for examination and reporting. They are commonly used with spreadsheet applications.

Data. Factual information in the form of text, numbers, graphics, etc. It is used as a basis for reasoning, discussion, and calculations. Also, numeric and other information in a format suitable for processing by a computer. See aggregate data, checksum, control data, data collecting, data management, data management software, data mining, data transfer, database, derived data.

Data Collecting. Bringing data from remote sites to a central location and organizing it into useful information.

Data Dictionary. A database about the structure of a database, including table names, structure, and information about their use. A Data Dictionary is commonly used to control user input during data entry.

Data Management. The work of controlling, processing, protecting, and facilitating access to data to provide information to those who need it. See data management software, information needs analysis.

Data Management Software. Software that facilitates controlling, processing, protecting, and facilitating access to data. It converts data into format that makes it easy manage by providing a user interface and specialized tools. See data management.

Data Mining. A technique using computer software designed to make it easy to search large amounts of data. It looks for general patterns or trends to filter the data based on search criteria.

Data Transfer. The process of moving data from one software application or OS to another.

Data Warehouse. An informational database used to store sharable operational information. It is commonly used to facilitate forecasting and planning. See DWA, filter, information warehouse.

Database. A collection of structured data arranged and organized for rapid search and retrieval using a computer. See data dictionary, data warehouse, DBA, DRDA, field, filter, index, record, relationship, schema, select, SQL.

DBA. Database Administrator. Controls, processes, protects, and facilitates access to databases.

Derived Data. Data resulting from computation or algorithms.

Desktop Application. Computer software that operates on a PC or workstation.

Deviation. The difference between what occurs and a fixed value.

Device Driver. Software that allows communication between a peripheral device or component and a computer.

Disk. A sometimes portable memory device coated with a magnetically sensitive substance on which data for a computer is stored. It is usually round, flat and rotates while in use. Data is usually stored and retrieved by heads mounted on movable or fixed arms.

Display. The representation of data and graphics on a computer video screen.

Double-click. Two closely spaced single clicks of a mouse button. It is commonly used to select or open a folder or file on a computer.

Document. Work created with computer software, such as by a word processor, and saved as a single unit. See edit, SGML.

Downtime. Periods of time when computer equipment is not available for use.

Drag/Drop. Placing a mouse pointer over an object on a computer video screen and holding down the mouse button to move the object to another location. When the object arrives at the new location, it is “dropped” into place.

DRDA. Distributed Relational Database Architecture. A database access standard developed by IBM.

Drill Down. A method of surveying detailed data that is used in a more general category of data.

DWA. Data Warehouse Administrator.

Edit. To modify an existing computer document.

E-mail. Electronic mail. A system to transmit and receive messages and files digitally over communication links, particularly the internet. Also, a message resulting from such a system. See server.

Ethernet. A standard for LAN’s developed jointly by Digital Equipment Corporation, Xerox, and Intel.

Fiber Optics. A data transmission medium that uses light conducted through thin transparent fibers of glass or plastic.

Field. A segment of a database table or spreadsheet containing a specific portion and kind of information. It can be one rectangle or an entire vertical column of a database table or spreadsheet. Compare column, foreign key, primary key, record, row.

File. Related data stored as a unit with a single name in a form that can be read by a computer.

File Import/Export. To bring data into (import) and send data out (export) of the data management system. See utility program.

Filter. Selected criteria that specify a subset of information in a spreadsheet, database, or data warehouse.

Firmware. Computer programs contained permanently in a hardware device in ROM rather than implemented through software.

Flowchart. Diagrammatic representation showing step-by-step progress in an operation or procedure.

Form. A computerized equivalent of paper forms. It allows standardized data entry into a data management systems.

Foreign Key. A field or combination of fields in a database table with a value matching the primary key in another table. See relationship.

FAQ. Frequently Asked Questions. A part of a website that contains answers to frequently asked questions.

Frequency. The number of cycles-per-second at which an analog signal occurs. It is usually expressed in hertz (Hz).

Front End. The user interface of a custom data management program. It the part of the program that allows the user to easily access and use data and other services.

FTP. File Transfer Protocol. The most common way to transfer files from one computer to another over the internet. See protocol.

Function. An operation or event called for in a computer software instruction.

Fuzzy Logic Algorithms that consider a range of acceptable values. It is used to facilitate processing imprecise data.

Gateway. Software or hardware that translate between computers on a network using different communication protocols.

GIF. Graphics Interchange Format. (Pronounced jiff.) A file compression format commonly used for transmitting raster-based graphic files over the internet.

Gopher. A menu-based system for searching web resources. Items are arranged hierarchically in layers, and each is either a file or folder.

GUI. Graphical User Interface. (Pronounced gooey.) Computer software interface usually characterized by the use of windows, icons, menus, and graphic controls.

Hardware. The physical components of a computer system such as circuit boards, disk drives, chassis, mouse, keyboard, and printer. Compare software. See module, OEM.

High-level Language. Any of several problem-oriented computer programming language in which English-style instruction correspond to several machine language instructions. See C, C+, C++, COBOL, compiler, Pascal.

Home Page. The webpage displayed when starting up a web browser. Also, the main page on a website. See index.

Host Computer. The primary computer on a network. It often acts as or is a server.

Householding. An algorithm used to consolidate names and addresses by residence or location.

HTML. Hyper Text Markup Language. An easy programming language commonly used to develop hypertext documents for the internet. Such documents can incorporate text, graphic images, sound, video, and hyperlinks. Tags and other elements are used to tell browsers how to display information on a computer video screen. It is a subset of SGML. See browser, WWW.

HTTP. Hyper Text Transfer Protocol. A set of conventions used to request and transmit web pages and other documents over the internet or other computer network. See protocol. gin-top: 0

Hyperlink. A link between related web pages and other document. It allows you to easily jump from one webpage or other document to another. See hypertext.

Hypertext. A document that is a combination of text, graphic images, sound, video, and/or hyperlinks.

Image Processing. Techniques for processing graphic images and video by a computer.

Index. A special file that tracks where and how data is stored in a database table. It is commonly used to allow faster access to information in a large database. Also, the home page of a website.

Information. Data processed in such a way that it can increase the knowledge of those who use it. It is the goal and end result of a data management system. See data, data collecting, data management.

Information Needs Analysis. The identification of the data management requirements necessary to meet the needs of a business or organization.

Information Warehouse. An special brand of data warehousing developed by IBM that supports the implementation of either functional, central or decentralized warehousing.

Input. A signal or data that is fed to a computer or peripheral device or component for processing.

Integrated Circuit. A complex set of small interconnected semiconductor devices, such as transistors, capacitors, and resistors, printed on a single silicon chip. See microprocessor.

Interface. The part of a computer software program designed to allow users to easily interact with their computer. It commonly uses windows, input boxes, menus, icons, and controls. See  client, data management software, front end, GUI, ODBC.

Internet. An interconnected system of networks that links computers around the world. It uses a low-level protocol called TCP/IP. See browser, e-mail, FTP, GIF, HTML, HTTP, IP address, newsgroup, TIFF, URL, WWW.

IP Address. Internet Address. A unique code assigned as an identifier for those connected to the internet.

LAN. Local Area Network. (Pronounced lan.) A network of usually physically linked computers, printers, and other electronic equipment that spans a limited area such as a building or office. See ethernet.

LED. Light Emitting Diode. A solid-state semiconductor device that emits light when a voltage is applied to it. It is used in electronic displays such as on calculators and digital watches.

Machine Language. Binary instructions that can be read directly by a computer without translation. Also know as machine code. See C, C+, C++, compiler, high-level language, programming language.

Macro. A single computer instruction that results in a sequence of commands. They are commonly used to streamline frequently-used functions, particularly in spreadsheet and word processor software.

Mainframe. A very large, powerful computer that can handle multiple tasks concurrently. It is commonly used by an entire business or organization. See server, supercomputer.

Make-to-Order. Products or services custom designed to meet specific needs of an individual, business, or organization.

Memory. An electronic device or component that can store data for later retrieval. See bar coding, buffer, byte, cache memory, disk, RAM, ROM.

Methodology. A system of rules and procedures used by those engaged in a discipline or inquiry.

Microcomputer. A PC (personal computer). They are generally used to run commercial software programs. It’s main component is a Microprocessor. See motherboard, multiprocessing, RAM.

Microprocessor. The main component of a microcomputer. It is an integrated circuit containing the entire computer processor on a single chip.

Minicomputer. A mid-sized computer between a microcomputer and a mainframe. It is commonly used for a single applications.

MIPS. Millions of Instructions Per Second.

MIS. Management Information Systems.

Modem. A device or component for transmitting and receiving digital data over telephone wires.

Module. A distinct unit of hardware. Also, a small computer program or routine designed for specific tasks that can be used alone or as part of a software program.

Motherboard. The main circuit board of a microcomputer. It usually containing the circuitry for the CPU, keyboard, and monitor and often has slots for additional circuitry.

Multiprocessing. Two or more CPU’s in the same microcomputer. It allows for faster operation and the ability to handle more complex tasks.

Multitasking. Operating two or more computer tasks at the same time, such as two software programs.

Nesting. A program subroutine or set of data contained within a larger routine or set of data.

Netware. A widely used network OS.

Network. A system of interconnected computers and peripherals. See administrator, authorization rules, client, client application, client/server, gateway, host computer, internet, LAN, newsgroup, node, securability, server, system integration, TCP/IP, token ring, topology, twisted pair.

Newsgroup. A area on the internet or a computer network devoted to discussion of a particular topic.

Node. A terminal, device, or component on a computer network. See topology, twisted pair.

Noise. Extraneous or meaningless data that sometimes obscures data analysis.

Null. An absent, unknown, or not yet known value. It should not be confused with zero, which is a value.

Object. An item that can be selected and maneuvered such as an icon.

Object Oriented Programming. Programming based on objects, such as windows, input boxes, icons, menus, and graphic controls, that include data and procedures to manipulate the data.

OCR. Optical Character Recognition The electronic identification and digital encoding of printed or handwritten characters using an optical scanner and specialized software.

ODBC. Open Database Connectivity. A interface protocol for accessing a variety of data sources. It uses SQL and was developed by Microsoft.

OEM. Original Equipment Manufacturer. Provides computers and peripherals made with hardware and software from other manufacturers and developers.

OLE. Object Linking and Embedding. A mechanism developed by Microsoft that allows computer software programs to communicate with each other.

OS. Operating System. System programs that control the operation of a computer system and direct the processing of software programs. Examples include Windows, DOS, Netware, and Unix. See data transfer.

Operator Interface. See Interface.

Option. A feature of a computer hardware device or component or a software program that allows the selection between two or more alternatives. See configuration.

Output. The result of computer processing including both paper documents and electronic files.

Parallel. The simultaneous transmission of all the bits of a byte over separate wires. Compare serial.

Pascal. A high-level computer programming language designed for structured programming.

PC. Personal Computer. See desktop application, microcomputer, server.

Pick List. A list of items that a software user can select from.

Pixel. The smallest element with controllable color and brightness in a computer video display.

Plotter. An output device that converts computer data into graphs and pictures on paper or computer video screen.

Port. The point at which signals from peripheral devices or components connect to a computer. Also, to modify software or data for use on a different hardware or software platform.

Primary Key. A field or combination of fields in a database table that have a unique value. It is used to identify a record. See foreign key, relationship.

Program. Coded instructions that enable a computer or peripheral device or component to perform a sequence of operations. See C, C+, C++, module, multitasking, nesting, OLE, OS, software, source code.

Programming Language. An artificial language used to write instructions that can be translated into machine language and then executed by a computer. See BASIC, C, C+, C++, COBOL, Pascal, source code, SQL, syntax.

Protocol. An agreed upon set of conventions that allow communication between computer systems. They generally govern the treatment and formation of data. See FTP, gateway, HTTP, internet, ODBC, TCP/IP, Telnet, URL.

Query. A selection of data from a database table or related databases based on chosen criteria, such as what results from an SQL operation. See Query Tool, select.

Query Tools. Software that allows a user to easy select data based on chosen criteria. See Query.

RAM. Random Access Memory. Computer memory that can be read from and written to by the computer software. It is the main memory device for microcomputers.

RDBMS. Relational Database Management System.

Real Time. The time a computer requires to complete an operation.

Record. A collection of related data treated as a unit. It commonly makes up one horizontal row of information in a database table or spreadsheet. Compare column, field.

Redundancy. More than one copy of the same data.

Relationship. The common values shared by two or more database tables. This links records in different database tables and is an essential component in relational database development. See foreign key, primary key.

Remote. A computer or peripheral device or component that is a significant distance from a user, such as in a different room or building. See data collecting.

Restore. To brining data back into its original or former state. It is the opposite of archiving and deleting.

Robotics. The study and design of robots.

ROM. Read Only Memory. Computer memory that can be read from but not written to by the computer software. It allows very fast access to permanently stored data. See CD-ROM, firmware.

Row. A horizontal string of fields beginning on one side of a spreadsheet and continuing to the other. Compare column, field, record.

Scalability. The ability of a system to effectively handle more and less data and users.

Schema. A diagrammatic representation of databases and their relationships.

Securability. The ability of computer network system to provide differing levels of access to different individuals and groups.

Select. A SQL statement specifying criteria for querying data from one database table or from more than one related database tables.

Semiconductor. The basis for integrated circuits. They are solid crystalline substances, usually germanium or silicon. See LED.

Serial. A computer connection where all the Bits of a Byte are transmitted sequentially. Compare parallel.

Server. A computer on a network that provides central data storage, access to shared peripherals, e-mail routing, and other services. They range in size and capacity from a PC to a mainframe. See client, client application, client/server, host computer.

Single-click. One click of a mouse. It is commonly used to select or open a folder or file or operate a control on a computer. Compare double-click.

Software. A computer program and related documentation. It directs the operation of a computer and peripherals for specific purposes. Commercial programs are often called this. See data transfer, device driver, document, interface, module, multitasking, OCR, OEM, OLE, pick list, port, RAM, ROM, SQL query tool. Compare firmware, hardware, OS.

Source Code. A computer program as written in a programming language. It must be translated to be read by a computer.

Spreadsheet. A computer program that that displays data in a ledger layout. It is commonly used for simple accounting, bookkeeping, and list management. Also, a worksheet or workbook. See column, crosstab, field, filter, record.

SQL. Structured Query Language. Pronounced S. Q. L. An easy programming language for querying and manipulating data in a database table or a relational database system. See ODBC, select, SQL Query Tool.

SQL Query Tool. Software that allows a user to easily select and manipulate data using SQL.

SGML. Standard Generalized Markup Language. A markup language for defining the structure of a computer document. HTML is a subset of this. See tag.

Supercomputer. A very fast, high-performance mainframe computer. They are commonly used to solve complex mathematical and scientific problems.

Syntax. Rules in a computer programming languages that guide how instructions can be written.

System Integration. The ability of computers and peripheral devices and components to communicate and work together in a network.

Tag. A programming element in HTML, SGML, and other markup languages. They are used to establish how document content is displayed on a computer screen or on paper.

TCP/IP. A protocol that governs communication between computers on a network and on the internet.

Telnet. A telecommunications protocol that allows interaction with remote computers and peripheral devices and components.

Terminal. Any device used to communicate with a computer in a remote location. They are used for data entry and other processing and are commonly equipped with a keyboard and video display.

TIFF. Tagged Image File Format. A file compression format commonly used for transmitting raster-based graphic files over the internet.

Token Ring. An access procedure used on a computer network.

Topology. The physical layout of a computer network system, particularly the connection between nodes.

Touch Screen. A computer screen that can be interacted with directly by touching elements on the display.

Trackball. A ball mounted in a stationary housing unit and rotated to control the movement of a cursor on a computer video display. They are commonly used on laptop computers.

Twisted Pair. A common wire used to connect nodes on a computer network. It consists of two copper conductors, both covered with insulation. It gets it’s name because the two wires are twisted to ensure they have equal exposure to the signals transmitted through it.

UPS. A battery power supply that can run a computer and peripheral devises and components temporarily if electrical power is interrupted.

URL. Uniform Resource Locator. An internet address commonly consisting of a protocol scheme (HTTP, FTP, etc.), the host domain name, and a file path.

Utility Program. A computer program that performs common data management tasks, such as importing/exporting and sorting data.

VGA. Video Graphics Array. A relatively high-resolution computer video display. It was introduced by IBM in 1987 to follow the EGA.

Windows. A very common GUI that uses rectangular areas called “windows” on the computer screen to displays data, icons, menus, controls, and other information. A computer screen can display more than one window. Also, a GUI introduced by Microsoft in 1990 as an extension of DOS, and which later became a graphical OS. See interface, object oriented programming.

Workbook. A collection of one or more worksheets. See spreadsheet.

Worksheet. An individual page of a spreadsheet application. Compare workbook.

Workstation. A stand-alone computer or terminal and any directly connected peripheral devises and components. See client, desktop application.

WWW. World Wide Web. An area of the internet, based on HTML, that is designed for user-friendly access to information using browsers.