cluster architecture

studied byStudied by 2 people
0.0(0)
get a hint
hint

System Administrator

1 / 25

Tags and Description

26 Terms

1

System Administrator

Responsible for the installation and maintenance of software.

New cards
2

Database Administrator

Overseeing the management of databases, ensuring their efficiency and security.

New cards
3

Tell me the three main course of data science and there overview ?

Data science is not a standalone course but a comprehensive major integrating various disciplines. It encompasses:

- Data Mining: Uncovering actionable insights from data.

- Big Data: Handling and analyzing massive datasets.

- Machine Learning: Utilizing algorithms to enable systems to learn and make predictions.

New cards
4

Linear Algebra

Understanding and manipulating matrices and vectors, fundamental to handling high-dimensional data.

New cards
5

Optimization

Techniques for optimizing models and algorithms to improve efficiency and performance in handling large datasets.

New cards
6

Dynamic Programming

A method for solving complex problems by breaking them down into simpler, overlapping subproblems, often used in optimization.

New cards
7

Hashing (LSH and Bloom Filter)

Utilizing hashing techniques, such as Locality-Sensitive Hashing (LSH) and Bloom filters, for efficient data storage and retrieval.

New cards
8

Streams and Concurrency

Concepts related to handling data streams and concurrent processing.

New cards
9

Small Data

Data that is small enough for human inference and accumulated slowly.

New cards
10

Big Data

Data generated in huge volumes and could be structured, semi-structured, or unstructured.

New cards
11

what is the 5 stage of data life cycle

Data Collection

Data Collection

Data Modeling

Data Processing

Data Visualization

<p>Data Collection </p><p>Data Collection</p><p>Data Modeling </p><p>Data Processing </p><p>Data Visualization </p>
New cards
12

Data Collection

The process of collecting data as a result of a business problem.

New cards
13

Data Modeling

Creating a data model to make sense of the collected data and establish relationships.

New cards
14

Data Processing

Using tools like Apache Spark to process and analyze the modeled data.

New cards
15

Data Visualization

Presenting data in a graphical format to derive meaningful insights.

New cards
16

Internet of Things (IoT)

An interconnected network of smart devices that collect, analyze, and act upon data.

New cards
17

Single Node Architecture

Execution of algorithms on a single CPU, with direct data access from memory

New cards
18

Data Scaling

The ability of a system to handle an increasing amount of data, including storage scaling and computational scaling.

New cards
19

identify the key tooling categories within the big data ecosystem

there are six tool in Big Data

  • Analytics an Visualization

  • Business Intelligence

  • Cloud providers

  • NoSQL

  • Programing tools

  • data technology

<p>there are six tool in Big Data</p><ul><li><p>Analytics an Visualization  </p></li></ul><ul><li><p>Business Intelligence </p></li></ul><ul><li><p>Cloud providers </p></li></ul><ul><li><p>NoSQL </p></li></ul><ul><li><p>Programing tools  </p></li><li><p> data technology</p></li></ul>
New cards
20

Type of Data

Different types of data, such as high-dimensional data, graph data, infinite data, and labeled data.

New cards
21

Descriptive Method

Finding human-interpretable patterns that describe the data.

New cards
22

Predictive Method

Using variables to predict unknown or future values of other variables.

New cards
23

Large-scale Computing

Dealing with machine failures and redundancy in storage infrastructure.

New cards
24

Distributed File System

A file system that enables clients to access file storage from multiple hosts through a computer network.

New cards
25

what is data mining

In the realm of data mining, various tasks contribute to the extraction of meaningful insights from vast datasets. For this course, our focus is on installing existing software and conducting analyses without delving deeply into database intricacies.

Data mining involves the extraction of substantial, Actionable Data

New cards
26

what is actionable data

Conclusive Meaning: The data leads to meaningful conclusions.

Relevance: The insights extracted are pertinent to the objectives at hand.
Consider actionable data as information that goes beyond raw figures. It leads to conclusions that hold significance. An example could be extracting insights that not only provide statistical information but also contribute to informed decision-making.

New cards

Explore top notes

note Note
studied byStudied by 170184 people
Updated ... ago
4.8 Stars(724)
note Note
studied byStudied by 10 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 3 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 758 people
Updated ... ago
5.0 Stars(3)
note Note
studied byStudied by 1 person
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 9 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 55 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 10 people
Updated ... ago
5.0 Stars(1)

Explore top flashcards

flashcards Flashcard163 terms
studied byStudied by 5 people
Updated ... ago
4.0 Stars(1)
flashcards Flashcard52 terms
studied byStudied by 6 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard77 terms
studied byStudied by 5 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard60 terms
studied byStudied by 6 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard62 terms
studied byStudied by 2 people
Updated ... ago
4.0 Stars(1)
flashcards Flashcard38 terms
studied byStudied by 170 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard63 terms
studied byStudied by 53 people
Updated ... ago
5.0 Stars(4)
flashcards Flashcard79 terms
studied byStudied by 1017 people
Updated ... ago
5.0 Stars(1)