Data Engineering
2023-06-30
Data Types in Snowflake
This article provides an overview of the various data types supported by Snowflake, including numeric, string and binary, logical, date and time, semi-structured, and geospatial data types, as well as the mappings between SQL data types and handler languages used in stored procedures and UDFs.
Data Engineering
Snowflake
2023-06-30
How to Implement Separate Development and Production Environments in Snowflake
This article explains the strategies of creating separate development, testing, and production environments in Snowflake.
Data Engineering
Snowflake
2023-05-29
Computing Geographic Distance with Snowflake UDF
This article explains the process of implementing a User-Defined Function (UDF) in Snowflake that computes geographic distances using the Haversine Formula.
Data Engineering
Snowflake
2023-01-23
Stored Procedures vs. UDFs in Snowflake
This article explains the distinctions between stored procedures and UDFs, including their purposes, usage behaviors, and database access capabilities.
Data Engineering
Snowflake
2022-12-30
The Differences Between Data Lakes, Data Warehouses, and Data Marts
This article provides a detailed comparison of data lakes, data warehouses, and data marts - the three primary solutions that have emerged to address the needs of modern organizations to store, process, and analyze massive amounts of data. The article highlights the unique features, use cases, and benefits of each solution. It compares the differences between data lakes, data warehouses, and data marts in terms of data types, data processing, data sources, data structure, and user access.
Data Engineering
2022-12-30
Typical data modeling approaches
This article explains about typical data modeling approaches.
Data Engineering
Data Modeling
2022-12-30
What is Dimensional Modeling
This article explores dimensional modeling, a data modeling technique used in data warehousing to optimize querying and analysis. It explains the importance of dimensional modeling in simplifying data models and enabling faster query performance, and discusses key components such as dimensions, facts, relationships, star and snowflake schemas, and conformed dimensions and facts. It also covers the critical considerations when designing a dimensional model, such as choosing between star and snowflake schemas, designing for performance, and creating data marts with dimension and fact tables.
Data Engineering
Data Modeling
2022-12-30
What is Star Schema
This article explains what a star schema is, how to design one, and how to query it using SQL. A star schema is a type of database modeling technique used in data warehousing that organizes data into a centralized table called a fact table that is surrounded by multiple dimension tables. It simplifies the structure of the data, making it easy for analysts and business users to understand and access it.
Data Engineering
Data Modeling
2022-12-30
DAG in dbt
This article explains about the DAG in dbt.
Data Engineering
dbt
2022-12-30
Designing Data Transformation Pipelines with dbt
This article provides a comprehensive guide to structuring a dbt (data build tool) project for effective data transformation. It covers defining sources, configuring source YAML, building staging models, and designing marts models.
Data Engineering
dbt
2022-12-30
Jinja and macros
This article explains about Jinja and macros in dbt.
Data Engineering
dbt
2022-12-30
Materializations
This article explains about the dbt Materializations.
Data Engineering
dbt
2022-12-30
dbt model
This article explains about the dbt model.
Data Engineering
dbt
2022-12-30
dbt Sources
This article explains about the dbt Sources.
Data Engineering
dbt
2022-12-30
dbt test
This article explains about dbt test.
Data Engineering
dbt
2022-12-30
What is dbt
This article explains about dbt.
Data Engineering
dbt
2022-12-11
UDF in Snowflake
This article explains User-Defined Functions (UDFs) in Snowflake, including their types, examples, supported languages, and the process of creating and calling UDFs.
Data Engineering
Snowflake
2022-12-09
Snowflake Resource Management with Terraform
This article explains how to manage Snowflake resources using Terraform.
Data Engineering
Snowflake
Terraform
2022-12-07
ELT (Extract, Load, Transform)
This article explains the concept of ELT (Extract, Load, Transform) as a data integration process, highlighting its advantages over the traditional ETL approach.
Data Engineering
2022-12-07
ETL (Extract, Transform, Load)
The article explains concepts of ETL (Extract, Transform, Load) and its critical role in consolidating data from diverse sources.
Data Engineering
2022-12-06
Machine Learning in Snowflake
This article explores the integration of machine learning (ML) within the Snowflake data platform, highlighting its unique architecture, data preparation and processing capabilities, and various approaches for building and deploying ML models. The article also discusses Snowflake's Snowpark, a developer-friendly environment for ML, and practical applications of ML in Snowflake across different industries and use cases, including customer segmentation, predictive maintenance, and fraud detection.
Data Engineering
Snowflake
Machine Learning
2022-12-06
Governance Management in Snowflake
This article explains Snowflake's governance management, ensuring data security and access control.
Data Engineering
Snowflake
2022-12-06
Stored Procedure in Snowflake
This article explains the concept of stored procedures i Snowflake.
Data Engineering
Snowflake
2022-12-05
Metadata Management
This article explains the role of metadata, data catalogs, and data lineage in maximizing data management efficiency.
Data Engineering
2022-12-05
Account Management in Snowflake
This article introduces how to manage Snowflake accounts and users.
Data Engineering
Snowflake
2022-12-05
What is Snowflake
This article explores the Snowflake data warehousing platform, its architecture, and key features such as Snowpipe, Snowpark, Zero Copy Cloning, Time Travel, and Secure Data Sharing. It also discusses the Snowflake Marketplace and compares Snowflake to competitors like BigQuery and Redshift.
Data Engineering
Snowflake
2022-11-09
How to Use Singer
This article explains how to use Singer, an open-source ETL tool.
Data Engineering
Singer
2022-11-09
Singer
This article explains the features and components of Singer, an open-source ETL tool.
Data Engineering
Singer
2022-11-05
What is Data Fabric
This article explores data fabric, a unified data infrastructure that streamlines data management and delivers insights more efficiently. It covers key components, benefits, architectural principles, and design considerations for scalable and flexible data fabric. The article also discusses data management and governance, popular data fabric platforms, and the differences between data fabric and data mesh, helping organizations choose the most suitable approach for their data challenges and objectives.
Data Engineering
2022-11-05
What is Data Warehouse
This article provides an overview of data warehouses, including their evolution, components, and popular solutions like BigQuery, Snowflake, and Amazon Redshift. The article also compares data warehouses with traditional databases, highlighting the key differences. It then delves into each of the popular solutions, discussing their features, pricing, and use cases. Lastly, the article outlines the key aspects of implementing and maintaining a data warehouse, including data ingestion, modeling, transformation, security, and governance.
Data Engineering
2022-11-05
Modern Data Stack
This article explains about the Modern Data Stack
Data Engineering
2022-10-15
About data management
This article describes about data management.
Data Engineering
2022-07-04
Data Platform Services in Google Cloud
This article presents a detailed overview of data platform services in Google Cloud, covering data warehousing with BigQuery, data lake solutions like Cloud Storage, Dataproc, and Dataplex, streaming analytics using Pub/Sub and Dataflow, business intelligence through Looker and Looker Studio, data integration with Data Fusion and Dataproc, workflow orchestration via Cloud Composer, and data security and governance using Data Catalog, Cloud DLP, and Cloud IAM.
Google Cloud
Data Engineering
2022-06-10
Migration to Meltano Version 2.0
This article introduces key points for migrating from Meltano Version 1.0 to Version 2.0.
Data Engineering
Meltano
2022-06-09
How to Use Meltano Plugins
This article introduces how to use Meltano plugins.
Data Engineering
Meltano
2022-06-08
Meltano Commands
This article introduces the main commands of Meltano.
Data Engineering
Meltano
2022-06-07
Handling Meltano Environments
This article explains how to handle Meltano environments.
Data Engineering
Meltano
2022-06-06
Transform in Meltano
This article introduces how to execute transforms in Meltano.
Data Engineering
Meltano
2022-06-05
Meltano
This article introduces Meltano, an ELT tool.
Data Engineering
Meltano
AlloyDB
Amazon Cognito
Amazon EC2
Amazon ECS
Amazon QuickSight
Amazon RDS
Amazon Redshift
Amazon S3
API
Autonomous Vehicle
AWS
AWS API Gateway
AWS Chalice
AWS Control Tower
AWS IAM
AWS Lambda
AWS VPC
BERT
BigQuery
Causal Inference
ChatGPT
Chrome Extension
CircleCI
Classification
Cloud Functions
Cloud IAM
Cloud Run
Cloud Storage
Clustering
CSS
Data Engineering
Data Modeling
Database
dbt
Decision Tree
Deep Learning
Descriptive Statistics
Differential Equation
Dimensionality Reduction
Discrete Choice Model
Docker
Economics
FastAPI
Firebase
GIS
git
GitHub
GitHub Actions
Google
Google Cloud
Google Search Console
Hugging Face
Hypothesis Testing
Inferential Statistics
Interval Estimation
JavaScript
Jinja
Kedro
Kubernetes
LightGBM
Linux
LLM
Mac
Machine Learning
Macroeconomics
Marketing
Mathematical Model
Meltano
MLflow
MLOps
MySQL
NextJS
NLP
Nodejs
NoSQL
ONNX
OpenAI
Optimization Problem
Optuna
Pandas
Pinecone
PostGIS
PostgreSQL
Probability Distribution
Product
Project
Psychology
Python
PyTorch
QGIS
R
ReactJS
Regression
Rideshare
SEO
Singer
sklearn
Slack
Snowflake
Software Development
SQL
Statistical Model
Statistics
Streamlit
Tabular
Tailwind CSS
TensorFlow
Terraform
Transportation
TypeScript
Urban Planning
Vector Database
Vertex AI
VSCode
XGBoost