A 200-SKU Magento store and a 20,000-SKU Magento store are fundamentally different businesses when it comes to analytics. The small store can be managed by feel — the owner knows which products move and which don't. The large store can't. At scale, catalog and inventory decisions require data, and Magento's built-in tools weren't designed for the kind of analysis large catalogs demand.
This guide covers the specific analytics challenges that come with large Magento catalogs and how to solve them.
Why large catalogs are an analytics problem
Magento handles large catalogs well from a technical standpoint. Its EAV architecture and indexing system can support hundreds of thousands of SKUs. But managing those SKUs effectively — knowing which ones earn money, which ones waste resources, and which ones should be discontinued — requires analytics that Magento doesn't provide natively.
The challenges compound:
Configurable products multiply complexity. A single t-shirt in five colors and six sizes creates 30 simple products under one configurable parent. Your catalog might show 500 products to customers but contain 5,000 SKUs in the database. Analytics need to work at both levels — which configurable products perform well, and which specific size/color combinations actually sell.
Custom attributes create data silos. Magento lets you create unlimited product attributes (brand, material, season, supplier, margin tier). Each attribute adds a dimension that could be useful for analysis. But there's no native way to analyze performance by custom attribute — you can't easily ask "what's my average margin by supplier?" or "which material type has the highest return rate?"
Category hierarchy obscures performance. Products can belong to multiple categories in Magento. A single SKU might sit in "Men's > Shoes > Running" and also in "Sale > Clearance." Revenue gets attributed to categories, but understanding which category structure actually drives discovery and purchase requires deeper analysis than Magento provides.
Catalog analytics: what to measure
Revenue vs. margin by category
Most merchants know which categories generate the most revenue. Far fewer know which categories generate the most profit. These are often different lists.
A common pattern: your highest-revenue category is electronics or branded goods with thin margins, while a lower-revenue category like accessories or own-brand products delivers twice the margin percentage. Without this visibility, you over-invest in promoting the wrong categories.
The analysis you need:
- Revenue by category (Magento provides this)
- Margin by category (Magento does not)
- Revenue-per-SKU by category — are some categories bloated with low-performers?
- Margin contribution — what percentage of total profit does each category deliver?
Attribute-level performance
This is where Magento's flexibility becomes both an asset and a blind spot. If you've set up attributes like brand, supplier, material, or collection, you're sitting on valuable segmentation data. But Magento has no built-in reports that break down sales performance by attribute.
Questions you should be able to answer:
- Which brands have the highest margin? Which are trending up or down?
- Which supplier's products have the lowest return rate?
- Does product material correlate with customer satisfaction or repeat purchase rate?
- Which seasonal collection performed best relative to inventory invested?
Each of these requires joining product attribute data with order data — straightforward conceptually, but complex in practice given Magento's EAV schema.
Configurable vs. simple product performance
Configurable products create a two-tier analytics problem. At the parent level, you want to know: is this product worth carrying? At the variant level, you want to know: which size/color/option combinations actually sell?
A common problem in large catalogs: a configurable product looks healthy in aggregate (decent total sales), but 80% of the revenue comes from two out of fifteen variants. The other thirteen variants sit in inventory, tying up capital and warehouse space. Without variant-level analysis, you keep reordering all fifteen options because the parent product "sells well."
The fix is straightforward analytically — break down sales velocity and margin by individual simple product within each configurable parent — but Magento's native reports don't support this view.
Inventory analytics for Magento
Multi-Source Inventory (MSI)
Magento 2 introduced Multi-Source Inventory, allowing merchants to track stock across multiple warehouses, stores, or dropship sources. MSI tracks:
- Which sources carry which products
- Stock quantity per source per SKU
- Reservation system for orders in process
- Source selection algorithm for order fulfillment
MSI solved a real operational problem: knowing where your stock is. But it doesn't answer the analytical questions that follow:
- Is stock allocated correctly across sources? If 70% of your orders ship from the east coast but 60% of your inventory sits on the west coast, you're paying unnecessary shipping costs.
- Which sources are overstocked? MSI tells you quantities but doesn't flag when a source has six months of supply while another is about to run out.
- What's the carrying cost by source? Warehouse costs vary. Inventory sitting in an expensive urban fulfillment center costs more than the same inventory in a rural warehouse.
Stock status and reorder points
Magento's low stock report tells you when products fall below a threshold. That's useful but reactive — you find out when something is almost out, not when you should have reordered.
Proper inventory analytics calculate:
- Sales velocity — units sold per day/week, with trend direction
- Days of supply — current stock divided by daily velocity. At today's sell rate, how long until stockout?
- Optimal reorder point — based on velocity, lead time from supplier, and desired safety stock
- Seasonal adjustment — a product selling 10 units per day in December but 2 per day in February needs different reorder timing depending on the month
None of this is available in Magento natively. Most merchants either rely on gut feel or build spreadsheets that quickly become outdated.
Identifying dead SKUs and catalog bloat
Every large catalog accumulates dead weight. Products that made sense when listed but no longer sell. Variations that never found an audience. Seasonal items that didn't get delisted. Over time, this bloat creates real costs:
- Carrying costs — inventory sitting in warehouses costs money every day
- Catalog noise — more SKUs means more pages to maintain, more images to host, and a harder time for customers to find what they actually want
- Marketing dilution — if you're running product feeds for ads, dead SKUs waste ad spend or dilute feed quality
- Operational overhead — more SKUs means more purchase orders, more receiving, more warehouse locations
How to identify dead stock
A dead SKU analysis needs multiple data points:
- Zero sales in 90+ days — the obvious signal. But check for seasonality first; a winter coat with no summer sales isn't dead.
- Declining velocity — products trending toward zero are dead stock in waiting. Catching them early gives you time to discount or liquidate before carrying costs eat the remaining margin.
- Low margin + low volume — even if a product still sells occasionally, if it's low margin and low volume, the operational cost of maintaining it may exceed the profit it generates.
- High return rate — a product that sells but comes back frequently is worse than a product that doesn't sell at all. Returns have hard costs (processing, restocking, potential damage) and soft costs (customer frustration).
The action plan for dead stock is a decision tree:
- Can it be discounted and sold through? Mark it down and move it.
- Can it be bundled with faster-moving products? Use it as a value-add.
- Should it be liquidated? Sell it to a liquidator at deep discount.
- Should it simply be discontinued? Remove it from the catalog, write off the remaining inventory, and free up the resources it consumed.
Catalog health metrics
Beyond individual dead SKUs, track these catalog-level metrics:
- Active SKU ratio — what percentage of your catalog generated at least one sale in the last 90 days? Healthy catalogs typically see 70-80%. Below 50% means serious bloat.
- Revenue concentration — what percentage of revenue comes from your top 20% of SKUs? High concentration (80/20 or worse) suggests a long tail of underperformers.
- Average revenue per SKU — total revenue divided by active SKU count. Track this over time; if you're adding SKUs faster than you're growing revenue, each product is getting less attention and generating less return.
How Spark handles Magento's catalog complexity
Spark by MishiPay connects to Magento through the REST API and handles the data model complexity that makes catalog and inventory analytics difficult to build manually.
When you connect a Magento store, Spark understands the relationships between configurable products and their simple product variants, between products and their custom attributes, and between inventory sources and stock allocations. It processes this through its analytical framework to give you answers without requiring you to understand Magento's internal schema.
Practical examples of what this looks like:
"Which product categories have the best margin but are understocked?"
Spark combines margin analysis with inventory velocity data to identify categories where you're leaving money on the table due to stockouts.
"Show me all SKUs with zero sales in the last 90 days that still have inventory."
A dead stock report that would take hours to build in a spreadsheet, delivered in seconds.
"Break down my top 10 configurable products by variant performance."
See exactly which size/color/option combinations are carrying each product — and which are dragging it down.
"What's my active SKU ratio and how has it changed over the last four quarters?"
Track catalog health over time to see if your catalog is getting leaner or more bloated.
For Magento merchants with large catalogs, the value proposition is direct: the data you need for better catalog and inventory decisions already exists in your store. The challenge is extracting it from Magento's complex data model and turning it into actionable insights. That's what Spark does.
Next steps for catalog optimization
If you're managing a large Magento catalog, start here:
- Run a dead stock audit. Identify every SKU with zero sales in 90+ days. Decide on an action for each one: discount, bundle, liquidate, or discontinue.
- Measure your active SKU ratio. If it's below 60%, your catalog needs pruning.
- Analyze margin by category, not just revenue. You may discover that your promotional focus is on the wrong categories.
- Break down configurable products. Stop reordering variants that don't sell just because the parent product does.
- Review MSI allocation. If you use multiple sources, check whether inventory is positioned where demand actually is.
Large catalogs are a competitive advantage when they're managed well. When they're not, they're an expensive liability. The difference is analytics.
Get clarity on your Magento catalog
Connect your store and let Spark identify dead SKUs, analyze category margins, and optimize inventory — no SQL required.