#Data Engineering
20 articles with this tag

Snowflake Taps AI for Retail Scale
Snowflake Intelligence is empowering retailers like the Mark Anthony Group to scale AI, democratize data access, and drive business outcomes through generative BI.

Snowflake's AI Coder Goes Broad
Snowflake's Cortex Code AI agent is now widely available, featuring enhanced usability and new capabilities for data engineering.
Databricks A/B Testing Framework Powers Game Analytics
HARDlight leverages Databricks to automate A/B testing analysis, doubling experimentation capacity and building trust through standardized, LLM-enhanced insights.

Snowflake's AI Coding Agent Streamlines Data Engineering
Snowflake's new AI coding agent, Cortex Code, aims to simplify data pipeline creation and accelerate development for engineers and analysts.
Snowflake Boosts AI Data Sharing
Snowflake enhances its AI data sharing platform with new features for reliability, usability, and transparency, crucial for production AI.
Databricks AutoCDC Ends Hand-Coding Pain
Databricks AutoCDC automates change data capture and SCD pipelines, slashing manual coding, improving performance, and cutting costs with declarative simplicity.
Spark Streaming Hits Millisecond Latency
Databricks' Apache Spark Structured Streaming real-time mode is now GA, offering sub-second latency and consolidating streaming needs onto a single engine.
Databricks Lakeflow Jobs vs Airflow
Databricks Lakeflow Jobs offers native lakehouse orchestration, mapping Airflow patterns like XComs, sensors, and branching to a more integrated, data-centric model.
Databricks Adds Free Data Ingestion Tier
Databricks launches a free tier for its Lakeflow Connect data ingestion tool and enhances its AI capabilities with Lakebase and Genie updates.
Data Science Careers: Skills & Paths
Explore the essential skills, diverse career paths, and educational routes shaping the data science landscape in 2024.
Databricks Serverless JARs Launch
Databricks Serverless JARs enable instant deployment of Scala/Java Spark jobs, eliminating cluster management and offering usage-based billing.
Spark Drops Microbatch for Real-Time
Apache Spark's Real-Time Mode (RTM) breaks microbatch barriers, enabling millisecond latency for streaming workloads with a new hybrid execution model.
Databricks Serverless Simplifies Data Ops
Databricks serverless compute automates infrastructure management, boosting performance and cutting costs for data engineering workflows.
Databricks' Genie Code: AI for Data Work
Databricks launches Genie Code, an AI agent designed to automate and optimize complex data workflows, promising to double success rates over traditional coding agents.
Databricks Unleashes Genie Code AI
Databricks launches Genie Code, an AI agent designed to automate data tasks and significantly improve success rates in data science.
Databricks Unlocks Billion-Scale Vector Search
Databricks unveils a redesigned vector search capable of handling billions of vectors, drastically cutting costs and improving scalability.
Databricks Streamlines Real-Time Data Apps
Databricks' Zerobus Ingest and Lakebase combine for streamlined IoT data ingestion and low-latency operational applications directly on the Lakehouse.
Databricks Lakehouse Gets Postgres Boost on Azure
Databricks launches Azure Databricks Lakebase, a serverless PostgreSQL service integrating operational data into the lakehouse for unified app development and analytics.
Databricks Reffy: From Tribal Data to AI Answers
Databricks' Reffy uses AI and RAG to turn scattered customer stories into an instantly searchable knowledge base for sales and marketing.
Spark Ditches Dual Engines for Real-Time Mode
Databricks' new Real-Time Mode for Spark aims to deliver sub-second streaming speeds, eliminating the need for separate processing engines.