Data Management Foundations- C175

studied byStudied by 4 people
4.0(1)
get a hint
hint

Flat Files

1 / 122

Tags and Description

123 Terms

1

Flat Files

a file having no internal hierarchy

New cards
2

Hashed Files

A file that has been encrypted for security purposes.

New cards
3

Heap File

An unsorted set of records.

New cards
4

Information

The transformation of raw data into useful facts.

New cards
5

Punch Card

A card that is perforated and can hold commands or data.

New cards
6

Structured Data

Information with a high degree of organization.

New cards
7

Unstructured Data

Information that does not have structure (such as text)

New cards
8

Binary Relationship

A relationship between two entity types.

New cards
9

Unary Relationship

An associate occurrence of an entity type with other occurrences of the same entity type.

New cards
10

Cardinality

The maximum number of entities that can be involved in a particular relationship.

New cards
11

E-R Model

*E-R = Entity - Relationship

Diagram of entities together with their attributes and the relationship among them.

New cards
12

Intersection Data

It is data that describes a many-to-many relationship.

New cards
13

Modality

It is a minimum number of entity occurrences that can be involved in a relationship.

New cards
14

One-to-one Binary Relationship

It means that a single occurrence of one entity type can be associated with a single occurrence of the other entity type and vice versa.

New cards
15

Ternary Relationship

Involves three different entity types.

New cards
16

Unique identifier

It is used to uniquely identify each record in a database table.

New cards
17

Attribute

A property, characteristic, or fact that we know about an entity.

New cards
18

"A salesperson works in one office."

What is the name of this relationship?

One-to-one binary relationship

New cards
19

"A salesperson sells to many customers."

What is the name of this relationship?

One-to-many binary relationship

New cards
20

"A salesperson is authorized to sell many products, and a product can be sold by many salespersons."

What is the name of this relationship?

Many-to-many binary relationship

New cards
21

What is the positioning and meaning for Cardinality and Modality on an E-R model?

Cardinality is the outer symbol; represents the maximum.

Modality is the inner symbol; represents the minimum.

New cards
22

"A salesperson works in a minimum of one and a maximum of one office, and an office may be occupied by or assigned to a minimum of zero and a maximum of one salesperson."

New cards
23

"A salesperson may have no customers or many customers."

New cards
24

Describe the ER model for "Each salesperson is authorized to sell to at least one or many products, and each product can be sold by at least one or many salespeople."

New cards
25

"One salesperson backs-up another salesperson."

What is the name of this model?

One-to-one unary relationship

New cards
26

"A salesperson manages zero to many other salespersons, and a salesperson is managed by exactly one other salesperson."

What is the name of this model?

One-to-many unary relationship

New cards
27

"A product can either be part of no other products or be part of several other products, and a product can either be composed of no other products or be composed of several other products."

What is the name of this model?

Many-to-many unary relationship

New cards
28

What does 'refer' in Referential Integrity imply?

This revolves around the circumstance of trying to refer to data in one relation in the database, based on values in another relation.

New cards
29

Define the delete rule RESTRICT.

If the delete rule between two relations is RESTRICT and an attempt is made to delete a record on the "one side" of the one-to-many relationship, the system will forbid the delete to take place if there are any matching foreign key values in the relation on the "many side".

New cards
30

Define the delete rule CASCADE.

If the delete rule between two relations is CASCADE and an attempt is made to delete a record on the "one side" of the relationship, not only will the record be deleted but all of the records on the "many side" of the relationship that have a matching foreign key value will also be deleted.

In other words, the delete will "cascade" from one relation to the other.

New cards
31

Define the delete rule SET-TO-NULL.

If the delete rule between the two relations is SET-TO-NULL and an attempt is made to delete a record on the "one side" of the one-to-many relationship, that record will be deleted and the matching foreign key values in the records on the "many side" of the relationship will be set to null.

Essentially, it's exactly like the CASCADE delete option, but instead of completely deleting all possible values, the values are set to NULL instead.

New cards
32

Which entity is uniquely identified by concatenating the primary keys of the two entities it connects?

Associative Entity

New cards
33

Which type of entity is also called a dependent entity?

Weak Entity

New cards
34

Candidate Key

This is when a relation has more than one attribute or minimum group of attributes that represents a way of uniquely identifying the entity.

New cards
35

Concurrency Problem

When two or more users are trying to update the same record simultaneously.

New cards
36

Equijoin

Combines two or more tables based on a column that is common to the tables.

Example: Joining Client and Salesman tables that both contain the SalesmanID column which have the exact same values.

New cards
37

Foreign Key

When an attribute or group of attributes serves as the primary key of one relation and also appears in another relation.

New cards
38

Natural Join

Matches each row in a table against each row in another table based on common values found in columns sharing a common name and data type.

New cards
39

Tuple

Rows/records are referred to as tuples when talking about relations. They serve the exact same function, it just has a different name in the context of relations.

New cards
40

What are the five basic principles of The Database Concept?

  1. The creation of a datacentric environment that is a significant company resource, which can be shared inside and outside the company.

  2. The ability to achieve data integration while storing data in a non-redundant fashion.

  3. The ability to store data representing entities involved in multiple relationships w/o introducing data redundancy.

  4. Managing data control issues such as data security, backup and recovery, and concurrency control.

  5. High degree of data independence.

New cards
41

What are the four major DBMS approaches?

  • Hierarchical

  • Network

  • Relational

  • Object-oriented

New cards
42

What are four key differences between a RELATION and a FILE?

  • The columns of a relation can be arranged in any order w/o affecting the meaning of the data. That is not true of a file.

  • Similarly, the rows of a relation can be arranged in any order, which is not true of a file.

  • Every row/column position, sometimes referred to as a "cell", can have only a single value, which is not necessarily true in a file.

  • No two rows of a relation are identical, which is not necessarily true in a file.

New cards
43
  • in the SELECT clause

  • It indicates that all attributes of the selected row are to be retrieved

New cards
44

AND operator

  • It displays a record if more than one condition is true

New cards
45

AVG() function

  • It returns the average value of a numeric column.

New cards
46

BETWEEN operator

  • It allows you to specify a range of numeric values in a search.

New cards
47

DISTINCT operator

  • It is used to eliminate duplicate rows in a query result.

New cards
48

IN operator

  • It allows you to specify a list of character strings to be included in a search

New cards
49

JOIN clause

  • It is used to combine rows from more than one table, based on a common field between them. Sometimes it is done by using the '=' symbol.

New cards
50

LIKE operator

  • It allows you to specify partial character strings in a "wildcard" sense.

New cards
51

OR operator

  • It displays a record it either the first condition OR the second condition is true.

New cards
52

ORDER BY clause

  • It simply takes the result of a SQL query and orders them by one or more specified attributes.

New cards
53

SELECT command

  • Data retrieval in SQL is accomplished with the SELECT command.

New cards
54

Subquery

  • When on SELECT statement is "nested" within another in a format, it is known as subquery. This is shown when there is a second SELECT phrase within a set of parenthesis.

New cards
55

Common DDL commands:

  • DROP

  • ALTER

  • RENAME

  • CREATE

  • TRUNCATE

New cards
56

Common DML commands:

  • UPDATE

  • DELETE

  • INSERT

  • MERGE

  • SELECT

New cards
57

Write the basic SQL query command:

SELECT
New cards
58

Write the SQL query to "Find the commission percentage and year of hire of salesperson 186":

SELECT COMMPERCT, YEARHIRE FROM SALESPERSON WHERE SPNUM=186;

New cards
59

Write the SQL query to "Retrieve the entire record for salesperson 186":

SELECT * FROM SALESPERSON WHERE SPNUM=186;

New cards
60

Write the SQL query to "List the salesperson numbers and salesperson names of those salespersons who have a commission percentage of 10.":

SELECT SPNUM, SPNAME FROM SALESPERSON WHERE COMMPERCT=10;

New cards
61

Write the SQL query to "List the salesperson numbers, salesperson names, and commission percentages of the salespersons whose commission percentage is less than 12.":

SELECT SPNUM, SPNAME, COMMPERCT FROM SALESPERSON WHERE COMMPERCT<12;

New cards
62

Write the SQL query to "List the customer numbers and headquarters cities of all customers that have a customer number of at least 1700":

SELECT CUSTNUM, HQCITY FROM CUSTOMER WHERE CUSTNUM>=1700;

New cards
63

Write the SQL query to "List the customer numbers, customer names, and headquarters cities of the customers that are headquartered in New York and that have a customer number higher than 1500":

SELECT CUSTNUM, CUSTNAME, HQCITY FROM CUSTOMER WHERE HQCITY='New York' AND CUSTNUM>1500;

New cards
64

Write the SQL query to "List the customer numbers, customer names, and headquarters cities of the customers that are headquartered in New York OR that have customer numbers higher than 1500":

SELECT CUSTNUM, CUSTNAME, HQCITY FROM CUSTOMER WHERE HQCITY='New York' OR CUSTNUM>1500;

New cards
65

Write the SQL query to "List the customers, customer names, and headquarters cities of the customers that are headquartered in New York or that satisfy the two conditions of having a customer number higher than 1500 and being headquartered in Atlanta":

SELECT CUSTNUM, CUSTNAME, HQCITY FROM CUSTOMER WHERE HQCITY='New York' OR (CUSTNUM>1500 AND HQCITY='Atlanta');

New cards
66

Write the SQL query to "List the customer records for those customers whose names begin with the letter 'A' ":

SELECT * FROM CUSTOMER WHERE CUSTNAME LIKE 'A%';

New cards
67

Write the SQL query to "Find the customer numbers, customer names, and headquarters cities of those customers with the customer numbers greater than 1000. List the results in alphabetic order by headquarters cities (and have the customer names within the same city alphabetized)":

SELECT CUSTNUM, CUSTNAME, HQCITY FROM CUSTOMER WHERE CUSTNUM>1000 ORDER BY HQCITY, CUSTNAME;

New cards
68

Write the SQL query to "Find the average quantity of units of the different products that Salesperson 137 has sold":

SELECT AVG(QUANTITY) FROM SALES WHERE SPNUM=137;

New cards
69

Write the SQL query to "Find the total quantity of units of all products that Salesperson 137 has sold":

SELECT SUM(QUANTITY) FROM SALES WHERE SPNUM=137;

New cards
70

Write the SQL query to "Find the name of the salesperson responsible for Customer Number 1525":

SELECT SPNAME FROM SALESPERSON, CUSTOMER WHERE SALESPERSON.SPNUM=CUSTOMER.SPNUM AND CUSTNUM=1525;

New cards
71

Write the SQL query to "List the NAMES of the products of which salesperson Adams has sold more than 2000 units":

SELECT PRODNAME FROM SALESPERSON, PRODUCT, SALES WHERE SALESPERSON.SPNUM=SALES.SPNUM AND SALES.PRODNUM=PRODUCT.PRODNUM AND SPNAME='Adams' AND QUANTITY>2000;

New cards
72

CREATE TABLE command

The command that creates base tables and tells the system what attributes will be in them.

New cards
73

CREATE VIEW command

Specifies the base tables on which the view is to be based and the attributes and rows of the table that are to be included in the view.

New cards
74

DELETE command

Specify which row(s) of a table are to be deleted based on data values within those rows.

New cards
75

DROP TABLE command

Discards an entire table from a database.

New cards
76

DROP VIEW command

Discards views.

New cards
77

Normalization

The process of organizing the fields and tables of a relational database to minimize redundancy (duplication) and dependency.

New cards
78

Second Normal Form

All non-key attributes must be functionally dependent on the entire key of that table.

New cards
79

Third Normal Form

Non-key attributes are not allowed to define other non-key attributes.

New cards
80

What are three important points about Third Normal Form?

  1. It is completely free of redundancy

  2. All foreign keys appear where needed to logically tie together related tables.

  3. It is the same structure that would have been derived from a properly drawn entity-relationship diagram of the same business environment.

New cards
81

Write the SQL query to "Add a new salesperson into the SALESPERSON table whose salesperson number is 489, name is Quinlan, commission percentage is 15, year of hire is 2011, and department number is 59.":

INSERT INTO SALESPERSON VALUES ('489','Quinlan',15,'2011','59');

*Hint, this is DML, so remember that INSERT is one of the keywords for DML.

New cards
82

Write the SQL query to "Delete the row for salesperson 186 from the SALESPERSON table.":

DELETE FROM SALESPERSON WHERE SPNUM = '186';

New cards
83

What is the correct syntax of the INSERT command?

INSERT INTO table_name VALUES (value1,value2,value3,...):

New cards
84

What is the correct syntax of the CREATE VIEW command?

CREATE VIEW view_name AS SELECT column_name(s) FROM table_name WHERE condition

New cards
85

What is called a decomposition process?

Data normalization

New cards
86

In which of the normal forms should every non-key attribute be fully functionally dependent on the entire key of a table?

Second form

New cards
87

What is the correct syntax of the CREATE TABLE command?

CREATE TABLE table_name ( column_name data_type(size), );

New cards
88

What is the correct syntax of the UPDATE command?

UPDATE table_name SET column1=value1,column2=value2,... WHERE some_column=some_value;

New cards
89

Association Rules

Association rules specify a relation between attributes that appears more frequently than expected if the attributes were independent.

New cards
90

Business Intelligence

The processes, technologies, and tools needed to turn data into information, information into knowledge, and knowledge into plans that drive profitable business action.

New cards
91

Classification

Classification involves examining the attributes of a particular object and assigning it to a defined class.

New cards
92

Clustering

Clustering is the task of taking a large collection of objects and dividing them into smaller groups of objects that exhibit some similarity.

New cards
93

Affinity Grouping

Affinity Grouping is a process of evaluating relationships or associations between data elements that demonstrate some kind of affinity between objects.

New cards
94

What are the values of Business Intelligence?

  • Financial value associated w/ increased profitability.

  • Productivity value associated with increased throughput.

  • Trust value (customer, employee, supplier satisfaction) as well as increased confidence in forecasting.

  • Risk value - decreased risk associated with decision making

New cards
95

What are the reasons for using the Dimensional Model for Business Intelligence?

  • Simplicity.

  • Lack of bias.

  • Extensibility.

New cards
96

What are the fundamental aspects of a Data Warehouse?

  • Centralized repository of information.

  • Organized around relevant subject areas.

  • Provides platform for queries.

  • Used for analysis and not transactional processing.

  • Data is nonvolatile.

  • Target location for integrating data from multiple sources.

New cards
97

What is the general theme of the ETL process?

  1. Get the data

  2. Map the data to staging area

  3. Validate and clean the data

  4. Apply necessary transformations

  5. Map data to loading model

  6. Move data to repository

  7. Load data to warehouse

New cards
98

What is the key factor based on the need for linear scalability?

Performance

New cards
99

What is used for populating summaries or any cube dimensions that can be performed at the staging area (ETL)?

Aggregation

New cards
100

What data mining activity is a process of assigning some continuously valued numeric value to an object?

Estimation

New cards

Explore top notes

note Note
studied byStudied by 6 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 4 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 12 people
Updated ... ago
5.0 Stars(2)
note Note
studied byStudied by 49 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 19 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 4 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 131405 people
Updated ... ago
4.8 Stars(623)

Explore top flashcards

flashcards Flashcard40 terms
studied byStudied by 2 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard128 terms
studied byStudied by 11 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard43 terms
studied byStudied by 19 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard87 terms
studied byStudied by 5 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard35 terms
studied byStudied by 51 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard37 terms
studied byStudied by 16 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard47 terms
studied byStudied by 1 person
Updated ... ago
5.0 Stars(1)
flashcards Flashcard86 terms
studied byStudied by 64 people
Updated ... ago
5.0 Stars(2)