Data Science | Universal Up Skills

Monitoring a stored procedure

by M Ruchi | Dec 7, 2025 | Data Science | 0 |

Monitoring a stored procedure involves tracking its execution to analyze performance, debug issues, and ensure correct functionality. Here are ways to monitor stored procedures in SQL Server or other relational database systems....

Difference CSV and Parquet

by M Ruchi | Dec 7, 2025 | Data Science | 0 |

CSV and Parquet are both file formats for storing data, but they differ in several ways: Storage structure CSV files are row-oriented, while Parquet files are column-oriented. In a CSV file, each line is a record, and each...

Unity Catalog in Azure Databricks

by M Ruchi | Dec 7, 2025 | Data Science | 0 |

Unity Catalog, a unified governance solution for data and AI assets on Azure Databricks. Overview of Unity Catalog Unity Catalog provides centralized access control, auditing, lineage, and data discovery capabilities across...

Z-Ordering in Apache Spark

by M Ruchi | Nov 10, 2025 | Data Science | 0 |

Learn how Z-Ordering in Apache Spark and Delta Lake optimizes query performance by clustering data based on specific columns. Discover how it reduces data scans, improves query speed, and enhances big data analytics efficiency.

Schema Enforcement in Delta Lake

by M Ruchi | Nov 4, 2025 | Data Science | 0 |

Schema Enforcement in Delta Lake Schema enforcement (also known as schema validation) is a key feature of Delta Lake, which ensures that the data being written to a Delta table matches a defined schema. This helps to maintain...

Understanding Delta Lake, Delta Table, and Delta Live Table in Databricks

by M Ruchi | Nov 4, 2025 | Data Science | 0 |

Understanding Delta Lake, Delta Table, and Delta Live Table in Databricks Databricks offers a suite of tools for managing and analyzing data efficiently, with Delta Lake, Delta Table, and Delta Live Table being core components....

Load Data from On-Premises to ADLS

by M Ruchi | Nov 4, 2025 | Data Science | 0 |

Loading data from an on-premises data source to Azure Data Lake Storage (ADLS) using Azure Data Factory (ADF) involves the following steps. The key component for connecting on-premises data sources is the Self-Hosted Integration...

Email Notifications in ADF

by M Ruchi | Jul 19, 2025 | Data Science | 0 |

In Azure Data Factory (ADF), you can set up email notifications to alert users when certain events occur, such as the success or failure of a pipeline or trigger execution. Here’s how you can implement it: 1. Using Logic Apps...

Triggers in ADF

by M Ruchi | Jul 16, 2025 | Data Science | 0 |

In Azure Data Factory (ADF), triggers are used to schedule and automate pipeline execution. They allow pipelines to run on specific schedules, in response to events, or when manually invoked. Triggers help orchestrate and manage...

load data from on-premise to Databricks

by M Ruchi | Jun 26, 2025 | Data Science | 0 |

To transfer data from an on-premise system to Databricks, you need to establish secure connectivity between your on-premise infrastructure and the Databricks environment. After setting up the connection, you can use Databricks...

Partitioning vs Bucketing in Hive and Spark

by M Ruchi | Jun 26, 2025 | Data Science | 0 |

In big data ecosystems like Hive and Apache Spark, partitioning and bucketing are powerful techniques used to organize data for better performance during query execution. While both aim to optimize data access, they work...

Data Science and Technologies used

by M Ruchi | Jun 26, 2025 | Data Science | 0 |

Here’s a table detailing key components of Data Science and the technologies commonly used: Data Science Component Description Technologies Used Data Collection Gathering raw data from various sources. SQL, MongoDB, APIs, Web...

Category: Data Science

Monitoring a stored procedure

Difference CSV and Parquet

Unity Catalog in Azure Databricks

Z-Ordering in Apache Spark

Schema Enforcement in Delta Lake

Understanding Delta Lake, Delta Table, and Delta Live Table in Databricks

Email Notifications in ADF

Triggers in ADF

load data from on-premise to Databricks

Partitioning vs Bucketing in Hive and Spark

Data Science and Technologies used

Categories