UGC Approved Journal no 63975(19)
New UGC Peer-Reviewed Rules

ISSN: 2349-5162 | ESTD Year : 2014
Volume 13 | Issue 3 | March 2026

JETIREXPLORE- Search Thousands of research papers



WhatsApp Contact
Click Here

Published in:

Volume 7 Issue 2
February-2020
eISSN: 2349-5162

UGC and ISSN approved 7.95 impact factor UGC Approved Journal no 63975

7.95 impact factor calculated by Google scholar

Unique Identifier

Published Paper ID:
JETIR2002539


Registration ID:
546379

Page Number

925-936

Share This Article


Jetir RMS

Title

Optimizing Data Engineering Through a Comprehensive Exploration of Modern Pipeline Tools

Abstract

There are currently many pipeline tools available for data engineering, which data scientists use to tackle data wrangling issues and perform tasks ranging from data ingestion and preparation to its use as input for machine learning (ML). These tools either come with essential built-in components or can be integrated with others to achieve the desired data engineering outcomes. While some of these tools are fully or partially commercial, there is also a range of open-source options capable of handling expert-level data engineering tasks. This survey explores various categories and examples of these pipeline tools based on their design and intended data engineering functions. The categories covered include Extract Transform Load/Extract Load Transform (ETL/ELT), Data Integration, Ingestion, and Transformation pipelines, Data Pipeline Orchestration and Workflow Management, and Machine Learning Pipelines. Additionally, the survey outlines how these tools are used within these categories, providing examples and discussing case studies that showcase real-world applications. These case studies illustrate user experiences with sample data, the complexities involved in the pipelines, and summarize approaches for preparing data for machine learning.

Key Words

Artificial Intelligence (AI), Data Engineering, Structured Data, Unstructured Data ETL, Machine Learning, Data Integration, Data Pipeline etc.

Cite This Article

"Optimizing Data Engineering Through a Comprehensive Exploration of Modern Pipeline Tools ", International Journal of Emerging Technologies and Innovative Research (www.jetir.org), ISSN:2349-5162, Vol.7, Issue 2, page no.925-936, February-2020, Available :http://www.jetir.org/papers/JETIR2002539.pdf

ISSN


2349-5162 | Impact Factor 7.95 Calculate by Google Scholar

An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 7.95 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator

Cite This Article

"Optimizing Data Engineering Through a Comprehensive Exploration of Modern Pipeline Tools ", International Journal of Emerging Technologies and Innovative Research (www.jetir.org | UGC and issn Approved), ISSN:2349-5162, Vol.7, Issue 2, page no. pp925-936, February-2020, Available at : http://www.jetir.org/papers/JETIR2002539.pdf

Publication Details

Published Paper ID: JETIR2002539
Registration ID: 546379
Published In: Volume 7 | Issue 2 | Year February-2020
DOI (Digital Object Identifier): https://doi.org/10.5281/zenodo.14190243
Page No: 925-936
Country: -, -, India .
Area: Engineering
ISSN Number: 2349-5162
Publisher: IJ Publication


Preview This Article


Downlaod

Click here for Article Preview

Download PDF

Downloads

000237

Print This Page

Current Call For Paper

Jetir RMS