HC1041 IT for Business
Individual Assignment Worth 30 marks due Friday Week 10
1500 words
Scenario
You have been asked to explore three database alternatives for a merchant who provides shoes to retail stores. The merchant needs the database to keep the financial records, employee records, inventory records and general use records for the business. Answer the questions below in your report.
1. Identify three database systems and discuss the purpose for choosing these three options
2.Pick only ONE of the above database systems and discuss the Hardware and software components required to run it
3. Discuss the costs involved in implementing the database
4. Discuss other factors which the business should consider before implementing the database
5. Provide sufficient citations throughout and matching
Individual Assignment
Create a report. 1500 words. Look at Week 08 lecture on organising data for more assistance. Justify all assumptions.
SafeAssign
Using the SafeAssign link in Blackboard to submit your report. Download and print out the FULL report and attach to the appendix at the back of your report. Assignments without the full SafeAssign report will NOT be marked.
Hard Copy
To be placed into assignment drop area on campus, including SafeAssign report, by 5pm Friday of week 10.
References/citations
You are expected to provide citations for every piece of information you include in your report. Any pictures used, other than ones you created, should be cited. Any information on computer systems, hardware etc, should be cited. You will lose marks for not providing citations.
Chapter 7
Organising Data and Information
2
3
Principles
The database approach to data management provides significant advantages over the traditional file-based approach.
A well-designed and well-managed database is an extremely valuable tool in supporting decision making.
The number and types of database applications will continue to evolve and yield real business benefits.
3
4
Learning Objectives
LO1: Define general data management concepts and terms, highlighting the advantages of the database approach to data management.
LO2: Describe the relational database model and outline its basic features.
4
5
Learning Objectives
LO3: Identify the common functions performed by all database management systems and identify popular end-user database management systems.
LO4: Identify and briefly discuss current database applications.
5
6
Data Management
Data consists of raw facts, such as employee numbers or sales figures.
For data to become useful information, it must first be organised in a meaningful way.
6
7
The Hierarchy of Data
A bit (a binary digit) represents a circuit that is either on or off.
Bits are organised into units called bytes.
A byte is typically 8 bits.
Character: Each byte represents a character, the basic building block of information.
Field: Typically a name, number, or combination of characters that describes an aspect of a business object or activity.
7
8
The Hierarchy of Data
Record: A collection of related data fields.
File: A collection of related records.
Database: A collection of integrated and related files.
Hierarchy of data: Formed by bits, characters, fields, records, files, and databases.
8
9
Data Entities, Attributes, and Keys
Entity: A generalised class of people, places, or things (objects) for which data is collected, stored, and maintained.
Attribute: A characteristic of an entity.
Data item: The specific value of an attribute.
Key: A field or set of fields in a record that is used to identify the record.
Primary key: A field or set of fields that uniquely identifies the record.
9
10
The Traditional Approach
Traditional approach: Separate data files are created and stored for each application program.
Results in data redundancy: duplication of data in separate files
Data redundancy conflicts with data integrity (the degree to which the data in any one file is accurate)
10
11
The Database Approach
Database approach: Approach whereby a pool of related data is shared by multiple application programs; offers significant advantages over traditional file-based approach.
Controls data redundancy
More efficient with increased data integrity
11
12
Advantages of the Database Approach
Improved strategic use of corporate data
Reduced data redundancy
Improved data integrity
Easier modification and upgrading
Data and program independence
Better access to data and information
Standardisation of data access
A framework for program development
Better overall protection of the data
Shared data and information resources
12
13
Disadvantages of the Database Approach
More complexity
More difficult to recover from a failure
More expensive
13
14
Data Modelling and the Relational Database Model
When building a database, an organisation must consider:
Content: What data should be collected and at what cost?
Access: What data should be provided to which users and when?
Logical structure: How should data be arranged so that it makes sense to a given user?
Physical organisation: Where should data be physically located?
14
15
Data Modelling
Building a database requires two types of designs:
Logical design: Shows an abstract model of how the data should be structured and arranged to meet an organisation’s information needs
Physical design: Starts from the logical database design and fine-tunes it for performance and cost considerations
15
16
Data Modelling
Data model: A diagram of data entities and their relationships.
Entity-relationship (ER) diagrams: Data models that use basic graphical symbols to show the organisation of and relationships between data.
16
17
The Relational Database Model
Relational model: Describes data in which all data elements are placed in two-dimensional tables, called relations, that are the logical equivalent of files.
In the relational model:
Each row of a table represents a data entity
Columns of the table represent attributes
Domain: The allowable values for data attributes.
17
18
Manipulating Data
Selecting: Eliminates rows according to certain criteria.
Projecting: Eliminates columns in a table.
Joining: Combines two or more tables.
Linking: Combines two or more tables using common data attributes to form a new table with only the unique data attributes.
18
19
Database Management Systems (DBMS)
A group of programs used as an interface between a database and application programs or a database and user.
Database types:
Flat file
Single user
Multiple users
19
20
Database Management Systems (DBMS)
Schema: A description of the entire database.
User view: The portion of the database a user can access. Subschemas are used to create different user views.
Subschema: A file that contains a description of a subset of the database and identifies which users can view and modify the data items in the subset.
20
21
Database Management Systems (DBMS)
Fig 7.10: The Use of Schemas and Subschemas
21
22
Creating and Modifying the Database
Data definition language (DDL):
A collection of instructions and commands used to define and describe data and data relationships in a specific database.
Allows the database’s creator to describe the data and the data relationships that are to be contained in the schema and subschemas.
Data dictionary: A detailed description of all the data used in the database.
22
23
Storing and Retrieving Data
When an application program request data from DBMS, the application program follows a logical access path.
When the DBMS goes to a storage device to retrieve the requested data, it follows a path to the physical location (physical access path) where the data is stored.
23
24
Manipulating Data and
Generating Reports
Data manipulation language (DML): The commands that are used to manipulate the data in a database.
Structured Query Language (SQL): Adopted by the American National Standards Institute (ANSI) as the standard query language for relational databases.
Once a database has been set up and loaded with data, it can produce reports, documents, and other outputs.
24
25
Database Administration
Responsibilities include:
Designing, implementing, and maintaining the database system and the DBMS
Establishing policies and procedures regarding management, security, maintenance and usage
Training employees in database management and use
25
26
Popular Database Management Systems
Popular DBMSs for end users include Microsoft’s Access and Corel’s Paradox.
The complete database management software market includes databases by IBM, Oracle, and Microsoft.
Examples of open-source database systems: PostgreSQL and MySQL.
Many traditional database programs are now available on open-source operating systems.
26
27
Special-purpose Database Systems
Used for specific purposes or industries.
Examples:
Summation or Concordance, used in law firms for legal documents
Scottish Intelligence Database, used by Scottish police to share crime reports
GlobalSpec, used for engineers and product designers
27
28
Selecting a Database Management System
Important characteristics of databases to consider:
Size of the database
Number of concurrent users
Performance
The ability of the DBMS to be integrated with other systems
Features of the DBMS
Vendor considerations
Cost of the system
28
29
Using Databases with Other Software
DBMS can act as front-end or back-end application.
Front-end application: Directly interacts with people or users.
Back-end application: Interacts with other programs or applications
29
30
Linking the Company Database to the Internet
Corporate databases can be accessed by customers, suppliers, and company employees through:
The Internet
Intranets
Extranets
Semantic Web: A seamless integration of traditional databases with the Internet
30
31
Data Warehouses, Data Marts, and Data Mining
Data warehouse: A database that collects business information from many sources in the enterprise, covering all aspects of the company’s processes, products, and customers.
Data mart: A subset of a data warehouse.
Data mining: An information-analysis tool that involves the automated discovery of patterns and relationships in a data warehouse.
31
32
Business Intelligence
Business intelligence (BI): The process of gathering enough of the right information in a timely manner and usable form and analysing it to have a positive impact on business strategy, tactics, or operations.
Knowledge management: The process of capturing a company’s collective expertise wherever it resides and distributing it wherever it can help produce the biggest payoff.
32
33
Distributed Databases
Distributed database:
A database in which the data may be spread across several smaller databases connected via telecommunications devices
Corporations get more flexibility in how databases are organised and used
Replicated database: A database that holds a duplicate set of frequently used data.
33
34
Online Analytical Processing (OLAP)
Programs used to store and deliver data warehouse information efficiently.
Supports data analysis and decision making
Top-down, query-driven data analysis
User must be very knowledgeable of the data and its business context
34
35
Object-Oriented and Object-Relational Database Management Systems
Object-oriented database:
Stores both data and its processing instructions
Method: A procedure or action
Message: A request to execute or run a method
35
36
Object-oriented database management system (OODBMS): Group of programs that manipulate an object-oriented database and provide a user interface and connections to other application programs.
Object-relational database management system (ORDBMS): DBMS capable of manipulating audio, video, and graphical data.
Object-Oriented and Object-Relational Database Management Systems
37
Visual, Audio, and Other Database Systems
Companies have an increasing need to store large amounts of visual and audio signals.
Credit card companies
Hospitals
Music companies, etc.
Virtual database systems: Allow different databases to work together as a unified database system.
37
38
Summary
Hierarchy of data: Bits, characters, fields, records, files, and databases.
Entity: A generalised class of things (objects) for which data is collected, stored, and maintained.
Attribute: Characteristic of an entity.
Data model: Diagram of entities and relationships.
Relational model: Describes data in which all elements are placed in two-dimensional tables called relations.
38
39
Summary
Selecting: Eliminates rows according to certain criteria.
Projecting: Eliminates columns in a table.
Database management system (DBMS): Group of programs used as an interface.
Between a database and application programs
Between a database and the user
Data dictionary: Detailed description of all the data used in the database.
39
40
Summary
Data warehouse: Database that collects business information from all aspects of a company’s processes, products, and customers.
Data mining: An information-analysis tool for the automated discovery of patterns and relationships in a data warehouse.
Open database connectivity (ODBC) standards: Ensure that software can be used with any ODBC-compliant database.
40
Week 08 Review Questions and Solutions (on Organising Data)
1. What is an attribute? How is it related to an entity?
2. Define the term database. How is it different from a database management system?
3. What is a flat file?
4. How would you describe the traditional approach to data management? How does it differ from the database approach?
5. What is data modelling? What is its purpose? Briefly describe three commonly used data models.
6. What is a database schema, and what is its purpose?
7. Identify important characteristics in selecting a database management system.
8. What is the difference between a data definition language (DDL) and a data manipulation language (DML)?
9. What is a distributed database system?
10. What is a data warehouse, and how is it different from a traditional database used to support OLTP?
11. What is OLAP?
12. What is an ORDBMS? What kind of data can it handle?
13. Give an example of a visual database.
1. An entity is a generalised class of people, place, or thing (object) for which data is collected, stored, and maintained. An attribute is a characteristic of an entity. For example, employee number, last name, first name, hire date, and department number are attributes for an employee.
2. Databases are collections of integrated and related files. The DBMS is the software used to manipulate the database and provide an interface between the database and the user or application programs. A database management system is systems software that helps organise data for effective access and storage by multiple applications. A DBMS provides different users different views of the data (sub-schemas), avoids redundancy, encourages program independence, offers flexible access, and provides centralised control.
3. A flat file is a simple database program that has no relationship between its records and is often used to store and manipulate a single table or file.
4. In a database approach, a pool of related data is shared by multiple application programs. Rather than having separate data files, each application uses a collection of data that is either joined or related in the database.
5. A data model is a diagram of entities and their relationships. Data modelling usually involves understanding a specific business problem and analysing the data and information needed to deliver a solution. Data modelling allows database designers to show the logical relationships among data. The following table provides a summary of the different data models:
Database Model Characteristics
Hierarchical Data is organised in a top-down or inverted tree structure
Network An extension of the hierarchical model in which a member may
have many owners
Relational A model that organises data in tabular format with rows and columns
6. A database schema is a description of the entire database. It can be part of the database or a separate file. The DBMS can reference a schema to find where to access a particular piece of data in relation to another.
7. The important characteristics that should influence the selection of a database management system include: 1) database size – how large does the database need to be?; 2) number of concurrent users – how many users will access the database at any one time?; 3) performance issues – how fast does the database need to perform various operations?; 4) integration – does the database need to interact with other systems?; 5) features – what wizards, security, documentation systems and other pertinent tools come with the database?; 6) vendor – does the software supplier have a reliable history and a current stability?; and 7) cost – how much will the system cost in terms of purchase and operation?
8. The data definition language is a collection of instructions and commands which enables the database’s creator to describe the data and data relationships that are to be contained in schemas and sub-schemas (user views of the data). A data manipulation language includes the set of commands used to manipulate the data in a database. SQL is an example.
9. A distributed database system is one in which the physical data may be spread across several smaller databases connected via networks or telecommunication devices. Although in separate locations, the data may appear in a single unified schema.
10. A data warehouse is a relational database management system designed to specifically support management decision making. An online transaction processing (OLTP) system collects business data as it is generated. This data, while efficiently collected, is not ready for meaningful analysis. Instead, it must be rendered into a steady state before it can be used for decision making. The data in a warehouse is derived from the online transaction processing system data but only after it has been ‘cleaned up’. The data warehouse then manages the flow of this data to end user applications.
11. OLAP tools can tell users what happened in their business. Online analytical processing (OLAP) programs are used to store and deliver data warehouse information. The OLAP allows users to explore corporate data in new and innovative ways using multiple dimensions such as products, salespeople, or time. OLAP programs include spreadsheets, reporting and analysis tools, and custom applications.
12. An object-relational database management system (ORDBMS) is a DBMS capable of manipulating audio, video, and graphical data.
13. A visual database can be stored in some object-relational databases or special purpose database systems. Purdue University has developed an audio database and processing software to give singers a voice makeover. The database software can correct pitch errors and modify voice patterns to introduce vibrato and other voice characteristics. Drug companies often need the ability to analyse a large number of visual images from laboratories. The PetroView database and analysis tool allows petroleum engineers to analyse geographic information to help them determine where to drill for oil and gas.