Skip to content
WhyLabs

WhyLabs

NEW ACQUIRED
Category: AI Security
License: Free (Open-Source) and Commercial
Suphi Cankurt
Suphi Cankurt
AppSec Enthusiast
Updated April 3, 2026
5 min read
Key Takeaways
  • Pioneering AI observability platform acquired by Apple in January 2025 — the commercial platform has been discontinued, but the full codebase was open-sourced.
  • whylogs (2.7k+ GitHub stars) remains available as an open standard for privacy-preserving data logging, creating statistical profiles of datasets without storing raw data.
  • LangKit (960+ GitHub stars) is an open-source LLM monitoring toolkit that extracts signals from prompts and responses for safety, relevance, and quality monitoring.
  • Founded by Alessya Visnjic with backing from Bezos Expeditions, AI Fund, Defy Partners, and Madrona — raised $14M total (Seed + Series A) before the Apple acquisition.
  • The privacy-preserving approach (statistical profiles instead of raw data) influenced the AI observability category and remains relevant through the open-source projects.

WhyLabs was a privacy-preserving AI observability platform that monitored model health, detected data drift, and ensured AI system quality by creating statistical profiles of data rather than storing raw inputs. Apple acquired WhyLabs in January 2025 and discontinued the commercial platform, but the full codebase — including whylogs (2.7k+ GitHub stars) and LangKit (960+ stars) — has been released as open source. It is listed in the AI security category.

Founded by Alessya Visnjic (who joined Apple as an engineering leader after the acquisition), WhyLabs was backed by Bezos Expeditions, AI Fund, Defy Partners, and Madrona Venture Group, raising $14M across Seed and Series A rounds. The company established itself as a pioneer in the AI observability category, with a distinctive focus on privacy-preserving monitoring — creating statistical profiles of data rather than storing raw inputs.

What is WhyLabs?

WhyLabs sat at the intersection of AI monitoring and data privacy. The core idea: organizations need to monitor their AI systems continuously but often cannot log raw data due to privacy regulations, compliance requirements, or the sheer volume of production data.

The platform solved this through statistical profiling. whylogs, the underlying open-source library, creates lightweight statistical summaries of datasets — capturing distributions, missing values, type information, and drift metrics — without retaining the original data. Teams could detect problems (data drift, quality degradation, model performance drops) while staying privacy-compliant.

With the Apple acquisition, the commercial platform is no longer available, but the technology lives on through three open-source projects.

whylogs (2.7k+ Stars)
Open standard for privacy-preserving data logging. Creates statistical profiles of datasets without storing raw data. Detects data drift, quality issues, and distribution changes while maintaining privacy compliance. Apache 2.0 licensed.
LangKit (960+ Stars)
Open-source LLM monitoring toolkit that extracts signals from prompts and responses. Monitors text quality, relevance, sentiment, toxicity, and safety. Compatible with whylogs for privacy-preserving LLM observability.
WhyLabs Platform (Open-Sourced)
The complete commercial platform codebase has been released as open source, enabling teams to self-host the full observability and monitoring stack that was previously available only as a commercial service.

Key Features

FeatureDetails
Core ApproachPrivacy-preserving monitoring through statistical profiling
Data Loggingwhylogs — lightweight profiles without raw data storage
LLM MonitoringLangKit — prompt/response quality, toxicity, relevance, safety
Drift DetectionStatistical comparison of data distributions over time
Data QualityMissing values, type validation, distribution anomalies
Model HealthPerformance tracking, accuracy monitoring, degradation alerts
PrivacyNo raw data storage — only statistical summaries
StatusAcquired by Apple (January 2025); commercial platform discontinued
Open SourceFull platform, whylogs, and LangKit available on GitHub
LicenseApache 2.0 (whylogs and LangKit)

whylogs: Privacy-Preserving Data Logging

whylogs is the library that made WhyLabs distinctive. It creates mergeable, lightweight statistical profiles of any dataset — whether tabular data feeding a classification model or text data flowing through an LLM.

A whylogs profile captures statistical summaries (mean, median, standard deviation, quantiles), distribution information (histograms, frequent items), data quality metrics (null counts, type mismatches), and cardinality estimates. These profiles can be compared over time to detect drift — when production data starts diverging from training data or from previous time periods. Because profiles are statistical summaries rather than raw data, they can be safely stored, shared, and analyzed without privacy concerns.

LangKit: LLM Monitoring

LangKit extends whylogs specifically for LLM monitoring. It extracts signals from prompts and responses that indicate quality, safety, and relevance:

  • Text quality — Readability scores, fluency metrics, coherence evaluation
  • Relevance — How well responses align with the prompt context
  • Sentiment analysis — Emotional tone and polarity detection
  • Toxicity detection — Harmful, offensive, or inappropriate content flagging
  • Safety assessment — Prompt injection indicators, sensitive content detection

LangKit is designed to work alongside whylogs, meaning all LLM monitoring data gets the same privacy-preserving treatment — statistical profiles rather than stored prompt-response pairs.

The Apple Acquisition

Apple acquired WhyLabs in January 2025 in a quiet deal disclosed through European Commission filings under the Digital Markets Act. WhyLabs co-founder and CEO Alessya Visnjic joined Apple as an engineering leader. The acquisition fit Apple’s expanding AI efforts and its need for internal AI observability.

Following the acquisition, WhyLabs discontinued its commercial platform and open-sourced the complete codebase, ensuring the technology remains available to the community.

Getting Started

1
Install whylogs — Get the data logging library: pip install whylogs. It works with pandas DataFrames, Spark, and raw Python data structures. Apache 2.0 licensed.
2
Profile your data — Create statistical profiles: import whylogs as why; result = why.log(df). Each profile captures distributions, quality metrics, and drift baselines without storing raw data.
3
Add LangKit for LLM monitoring — Install LangKit: pip install langkit. Configure it to extract quality, safety, and relevance signals from LLM prompts and responses.
4
Compare profiles for drift — Compare profiles over time to detect when production data or model behavior diverges from expectations. Use built-in visualization tools or export metrics to your monitoring stack.
5
Self-host the platform (optional) — For the full dashboard experience, deploy the open-sourced WhyLabs platform on your infrastructure. The codebase is available on GitHub under the WhyLabs organization.

When to Use WhyLabs Tools

Although the commercial platform is gone, the open-source components remain valuable for specific use cases. whylogs is particularly useful for teams that need to monitor data quality and drift in privacy-sensitive environments — healthcare, finance, and any domain where logging raw production data creates compliance risks. LangKit is a lightweight, privacy-preserving alternative to commercial LLM monitoring platforms.

The trade-off is that self-hosting requires more operational effort than using a managed platform, and the open-source projects will not receive the same level of commercial development and support they had before the acquisition.

Acquired by Apple
WhyLabs was acquired by Apple in January 2025. The commercial platform has been discontinued, but the full codebase, whylogs, and LangKit have been open-sourced. For actively maintained commercial AI observability, consider Arize AI or Arthur AI.

How WhyLabs Compares

WhyLabs pioneered privacy-preserving AI observability, and its open-source tools remain relevant in the AI security landscape. For actively maintained commercial AI observability, see Arize AI (OpenTelemetry-based tracing, Phoenix open-source, AX enterprise platform) or Arthur AI (model monitoring, bias detection, LLM firewall).

For LLM-specific security rather than observability, consider Lakera Guard or Prompt Security. For runtime guardrails, see NeMo Guardrails or LLM Guard.

For a broader overview of AI security tools, see the AI security tools category page.

Note: Acquired by Apple in January 2025 and subsequently closed. The full platform, whylogs, and LangKit have been open-sourced at github.com/whylabs.

Frequently Asked Questions

What is WhyLabs?
WhyLabs was an AI observability platform that helped organizations monitor model health, detect data drift, and ensure AI system quality. Founded by Alessya Visnjic, the company pioneered privacy-preserving monitoring through statistical profiling. Apple acquired WhyLabs in January 2025, the commercial platform was discontinued, and the codebase was open-sourced.
Is WhyLabs still available?
The commercial WhyLabs platform has been discontinued following Apple’s acquisition in January 2025. However, the full platform source code, whylogs (data logging library), and LangKit (LLM monitoring toolkit) have all been open-sourced. Teams can self-host the platform or use the individual libraries independently.
What is whylogs?
whylogs is an open-source data logging library for machine learning models and data pipelines. It creates statistical profiles of datasets — capturing distributions, missing values, and data types — without storing raw data. This privacy-preserving approach means you can monitor data quality and detect drift without exposing sensitive information. Available on GitHub with 2.7k+ stars.
What is LangKit?
LangKit is an open-source toolkit for monitoring large language models. It extracts signals from prompts and responses including text quality metrics, relevance scores, sentiment analysis, toxicity detection, and safety assessments. LangKit is compatible with whylogs for privacy-preserving LLM monitoring. Available on GitHub with 960+ stars.
How does WhyLabs compare to Arize AI?
Both were AI observability platforms, but WhyLabs was acquired by Apple in January 2025 and is no longer available as a commercial service. Arize AI remains active with its AX enterprise platform and Phoenix open-source tool. WhyLabs differentiated on privacy-preserving monitoring through statistical profiling; Arize emphasizes OpenTelemetry-based tracing and evaluation. For active alternatives, consider Arize AI or Arthur AI.