ETL (Extract-Transform-Load)

ETL comes from Data Warehousing and stands for Extract-Transform-Load. ETL covers a process of how the data are loaded from the source system to the data warehouse. Currently, the ETL encompasses a cleaning step as a separate step. The sequence is then Extract-Clean-Transform-Load. Let us briefly ……

ELT vs. ETL

Many organizations are increasingly turning to ELT(Extract, Load, and Transform) tools to address the volume, variety, and velocity of big data sources, which often strain conventional Extract, Transform and Load (ETL) tools designed for internal, relational data warehousing. ELT vs ETL: What’s t……

Hadoop Best Practices for Data Ingestion

Hadoop Data ingestion is the beginning of your data pipeline in a data lake. It means taking data from various silo databases and files and putting it into Hadoop. Sounds arduous? For many companies, it does turn out to be an intricate task. That is why they take more than a year to ingest all the……

What is Data Lake? It’s Architecture

What is Data Lake? A Data Lake is a storage repository that can store large amount of structured, semi-structured, and unstructured data. It is a place to store every type of data in its native format with no fixed limits on account size or file. It offers high data quantity to increase analytic p……