Course description
IBM Data Engineering Professional Certificate
This Professional Certificate is for anyone who wants to develop job-ready skills, tools, and a portfolio for an entry-level data engineer position. Throughout the self-paced online courses, you will immerse yourself in the role of a data engineer and acquire the essential skills you need to work with a range of tools and databases to design, deploy, and manage structured and unstructured data.
By the end of this Professional Certificate, you will be able to explain and perform the key tasks required in a data engineering role. You will use the Python programming language and Linux/UNIX shell scripts to extract, transform and load (ETL) data. You will work with Relational Databases (RDBMS) and query data using SQL statements. You will use NoSQL databases and unstructured data. You will be introduced to Big Data and work with Big Data engines like Hadoop and Spark. You will gain experience with creating Data Warehouses and utilize Business Intelligence tools to analyze and extract insights.
This program does not require any prior data engineering, or programming experience.
Do you work at this organisation and want to update this page?
Is there out-of-date information about your organisation or courses published here? Fill out this form to get in touch with us.
Upcoming start dates
Suitability - Who should attend?
No prior experience required.
Outcome / Qualification etc.
What you will learn
- Create, design, and manage relational databases and apply database administration (DBA) concepts to RDBMSes such as MySQL, PostgreSQL, and IBM Db2.
- Develop and execute SQL queries using SELECT, INSERT, UPDATE, DELETE statements, database functions, stored procedures, Nested Queries, and JOINs.
- Demonstrate working knowledge of NoSQL & Big Data using MongoDB, Cassandra, Cloudant, Hadoop, Apache Spark, Spark SQL, Spark ML, Spark Streaming.
- Implement ETL & Data Pipelines with Bash, Airflow & Kafka; architect, populate, deploy Data Warehouses; create BI reports & interactive dashboards.
Skills you will gain
- Relational Database Management Syste (RDBMS)
- ETL & Data Pipelines
- NoSQL and Big Data
- Apache Spark
- SQL
- Data Science
- Database (DBMS)
- NoSQL
- Python Programming
- Data Analysis
- Pandas
- Numpy
Training Course Content
- Introduction to Data Engineering
- Python for Data Science, AI & Development
- Python Project for Data Engineering
- Introduction to Relational Databases (RDBMS)
- Databases and SQL for Data Science with Python
- Hands-on Introduction to Linux Commands and Shell Scripting
- Relational Database Administration (DBA)
- ETL and Data Pipelines with Shell, Airflow and Kafka
- Getting Started with Data Warehousing and BI Analytics
- Introduction to NoSQL Databases
- Introduction to Big Data with Spark and Hadoop
- Data Engineering and Machine Learning using Spark
- Data Engineering Capstone Project
Course delivery details
This course is offered through IBM Skills Network, a partner institute of Coursera.
Under 10 hours of study a week
Expenses
Please visit the Institute website for more information about tuition fees