Get in Touch

Course Outline

  1. Fundamentals of Big Data
    • The role of Big Data in the corporate landscape
    • Phases involved in developing a corporate Big Data strategy
    • Understanding the rationale behind a holistic Big Data approach
    • Essential components of a Big Data platform
    • Big Data storage solutions
    • Limitations of traditional technologies
    • Overview of database types
    • The four dimensions of Big Data
  2. Business Impact of Big Data
    • Strategic importance of Big Data for business
    • Challenges associated with extracting actionable insights
    • Integrating Big Data with traditional data systems
  3. Big Data Storage Technologies
    • Overview of Big Data technologies
      • Data storage models
      • Hadoop
      • Hive
      • Cassandra
      • MongoDB
    • Selecting the appropriate Big Data technology
  4. Processing Big Data
    • Connecting to and extracting data from databases
    • Transforming and preparing data for processing
    • Utilizing Hadoop MapReduce for distributed data processing
    • Monitoring and executing Hadoop MapReduce jobs
    • Core building blocks of the Hadoop Distributed File System
    • MapReduce and Yarn
    • Handling streaming data with Spark
  5. Big Data Analysis Tools and Technologies
    • Programming Hadoop using Pig Latin
    • Querying Big Data with Hive
    • Data mining with Mahout
    • Visualization and reporting tools
  6. Big Data in Business Context
    • Managing and defining Big Data requirements
    • Strategic importance of Big Data for business
    • Selecting the right Big Data tools for specific problems

Data Warehousing Concepts

  • Defining a Data Warehouse
  • Differences between OLTP and Data Warehousing
  • Data Acquisition
  • Data Extraction
  • Data Transformation
  • Data Loading
  • Data Marts
  • Dependent vs. Independent Data Marts
  • Database Design

ETL Testing Concepts:

  • Introduction
  • Software Development Life Cycle
  • Testing methodologies
  • ETL Testing Workflow Process
  • ETL Testing Responsibilities in Data Stage.

Big Data Fundamentals

  • The role of Big Data in the corporate landscape
  • Phases involved in developing a corporate Big Data strategy
  • Understanding the rationale behind a holistic Big Data approach
  • Essential components of a Big Data platform
  • Big Data storage solutions
  • Limitations of traditional technologies
  • Overview of database types

NoSQL Databases

Hadoop

Map Reduce

Apache Spark

Requirements

Participants should possess a foundational awareness and some practical experience with storage tools, as well as an understanding of how to manage large datasets.

 14 Hours

Number of participants


Price per participant

Testimonials (1)

Upcoming Courses

Related Categories