Data Catalog Software That Doesn't Need Babysitting to Stay Useful

Auto-discovered. Auto-lineaged. Auto-updated. Stewards govern; the platform maintains.

Most data catalogs die in their second year - when the project sponsor moves teams and nobody updates entries. Logiciel auto-discovers and auto-maintains the catalog, so stewards spend their time on decisions, not data entry.

See Logiciel in Action

Your catalog was great. For 6 months.

What happens to most catalog projects:

Year 1: launched with executive enthusiasm. Year-1 launch enthusiasm followed by year-2 stewardship turnover is the structural failure mode of manual catalogs, not an execution problem.
Year 2: 70% of entries are stale; engineers Slack the steward instead. Stale catalogs train engineers to bypass them entirely, defeating the original investment.
Year 3: re-evaluation cycle. Same vendor, different promises. Three-year revaluation cycles with same-vendor different-promises indicate the underlying problem is structural, not vendor-specific.

If you're shopping data catalog software, you've felt the maintenance gap

Teams here typically need:

Auto-discovery - not manual onboarding. Auto-discovery is the structural fix for catalog decay; manual stewardship workflows can't keep pace with code-change velocity at scale.

Auto-lineage - derived from runtime, not declared. Auto-lineage derived from runtime is the only lineage approach that survives at modern data team velocity.

Active metadata - used, not just searched. Active metadata used by quality, governance, and observability tools is the structural difference between a catalog as wiki and a catalog as control plane.

What you get with Logiciel

A catalog that stays accurate.

Auto-discovery - connects to warehouses, BI, dbt, ML platforms. Auto-discovery across warehouses, BI, dbt, ML platforms eliminates the manual onboarding work that historically derailed catalog projects.
Column-level lineage - auto-derived, always current. Column-level lineage auto-derived from query logs and dbt manifests stays current as code changes - the only approach that survives at scale.
Active metadata - feeds quality, access control, observability. Active metadata feeding quality, access control, and observability turns the catalog from passive documentation into a working part of the data plane.
Steward workflows - curate the 5% that matters; platform handles the rest. Steward workflows for the 5% that matters means humans focus on judgment, not data entry - the only sustainable model.

Where this fits - industries we serve in the US

FinTech & Financial Services

Trading data, risk models, regulatory reporting - sub-second SLAs and audit-ready governance.

PropTech & Real Estate

Listing data, transaction pipelines, geospatial analytics - multi-source consolidation.

Healthcare & Life Sciences

EHR integration, claims pipelines, clinical analytics - HIPAA-aware infrastructure.

B2B SaaS

Product analytics, customer 360, usage-based billing - embedded and operational data.

eCommerce & Marketplaces

Inventory, pricing, order, and customer pipelines - real-time and high-throughput.

Construction & Industrial Tech

IoT, project, and supply-chain data - operational analytics on hybrid stacks.

Engagement models that fit your stage

Dedicated Pod	Staff Augmentation	Project-Based Delivery
Embedded data engineering pod aligned to your sprint cadence - typically 3–6 engineers + a US lead.	Senior data engineers, architects, and SMEs slotted into your team to unblock specific work.	Fixed-scope, milestone-driven engagements with clear deliverables and outcomes.

From first call to first production pipeline

Discover

We map your stack, workloads, team, and constraints in a working session - not an RFP response.

Architect

Reference architecture grounded in your reality, with capacity, cost, and migration plans.

Build

Iterative implementation with weekly demos, code reviews, and your team in the loop.

Operate

Managed operations or knowledge transfer - your choice. Both with US-aligned coverage.

Optimize

Continuous tuning of cost, performance, and reliability against measurable SLAs.

Catalog capabilities

Auto-Cataloging

Connect, discover, classify - no manual onboarding.

Business Glossary

Map business terms to physical data; versioned.

Search & Discovery

Semantic search, recommendation, certified-asset workflows.

Column-Level Lineage

Cross-system, auto-derived.

Tagging & Classification

PII, sensitivity, retention - applied via metadata.

API & Integration

Programmatic access for downstream tools.

Extended FAQs

How is this different from Atlan, Collibra, Alation?

Engineering-friendly authoring (Git-native, API-first, programmatic), faster deployment (24 hours to auto-discovered catalog vs 12-18 months for Collibra), lower TCO at scale, and a strong active-metadata story. Atlan is the closest competitor in modern positioning; Logiciel differentiates on broader scope (catalog plus quality plus pipeline observability plus cost), engineering ergonomics (Terraform, CI/CD, code-reviewed metadata), and integrated governance enforcement. Collibra and Alation are mature stewardship platforms - capable but bureaucratic. Most customers replacing those tools consolidate 2-3 EDM line items into Logiciel with 40-60% TCO reduction. We don't compete on enterprise stewardship workflow depth; for the 95% of US enterprise needs, our trade-offs win.

How does auto-lineage work?

We parse query logs (Snowflake, Databricks, BigQuery, Redshift query history), dbt manifests, Airflow DAGs, Python instrumentation, and BI tool metadata (Looker LookML, Tableau workbooks, Mode notebooks) to derive column-level lineage automatically. No manual declaration required; lineage updates continuously as new query logs arrive (typically within 5-15 minutes). For non-SQL transformations (Python, Spark), SDK instrumentation captures lineage at runtime with 1-3 lines of code per script. Accuracy is typically 95%+ for SQL-heavy stacks. Lineage is queryable via API for downstream tools - impact analysis, audit reports, change management, AI agents needing governed access.

How do business users find data?

Semantic search with recommendations, certified-asset workflows, and a clean UX designed for analysts and stewards rather than engineers. Business users search by business term ('revenue', 'churn rate', 'active customer'), see certified assets ranked first, view lineage as a visual graph, and propose changes through governed workflows. Glossary curation is structured (definition, owner, classification, examples) without being bureaucratic. For self-service analytics teams, Logiciel typically reduces time-to-find-the-right-table from days to minutes - measurable in stakeholder satisfaction surveys and reduced 'where do I find X' Slack messages. AI-powered search recommendations improve over time as usage patterns establish.

Time to value?

Connect your warehouse and BI tool - 24 hours later, you have an auto-discovered catalog of your top 500 assets with column-level lineage. The first week is baseline establishment and tuning; weeks 2-4 are governance configuration (classifications, glossary, ownership) and stakeholder onboarding; by day 30, most teams have a fully populated catalog covering their most-used 1,000-5,000 assets with active stewardship workflows. ROI in the first quarter is typically expressed as analyst time saved (faster discovery), reduced 'where do I find X' overhead, and improved data trust (certified assets ranked first reduces use of stale data). Most customers consolidate 2-3 ad-hoc tools into Logiciel within 6 months.

Can we migrate from our current catalog?

Yes - migration tooling for Atlan, Collibra, Alation, IBM Cloud Pak for Data, Informatica, and other major catalogs. Migration extracts metadata, lineage, glossary, and governance policies from your existing catalog, ingesting into Logiciel while preserving user-facing context. Migration runs in parallel: Logiciel ingests metadata while continuing to honor existing user workflows; over 6-12 months, business users transition to Logiciel's UI and the legacy catalog is retired. Migration is fixed-fee scoped to your catalog footprint. About 60% of customers retire the legacy after 12 months; 40% keep the legacy as a search front-end while Logiciel becomes the metadata of record. Either pattern works.

What about non-SQL transformations?

Python and Spark instrumentation cover those - typically 1-3 lines of code per script to capture lineage at runtime. The SDK wraps DataFrame and Spark operations to track inputs and outputs at the column level. For dbt-Python models, lineage is derived automatically from model definitions. For ad-hoc Python scripts (Pandas transforms, ML feature engineering, custom ETL), the SDK provides drop-in instrumentation. Declarative APIs cover edge cases where instrumentation is impractical (closed-source systems, third-party scripts) - you can declare lineage manually with versioned manifests. We don't require declarative lineage for everything; instrumentation coverage is typically 95%+ for Python-heavy stacks.

Free tier?

Yes - first 500 assets free, forever. The free tier includes auto-discovery, column-level lineage, basic search, business glossary, and read-only API access on those 500 assets. No credit card, no time limit. About 30% of free-tier users convert to paid within 6 months when their asset count outgrows 500 or when they want enterprise features (advanced governance, custom workflows, SSO, audit logging). The other 70% stay free, which is the design intent - making catalog accessible to teams that can't budget enterprise tooling but still need their data discoverable. Free tier is functionally complete for small teams; enterprise tier adds the governance and operations layers.

Get a real catalog in 24 hours

Connect your warehouse and BI tool. In 24 hours we deliver an auto-discovered catalog of your top 500 assets - free, no commitment.

Try Data Catalog Free