A new pillar on the weekly review

Restaurant operations management has seven named pillars. The eighth was unnamed because it had no primitive.

Every published playbook converges on inventory, labor, food cost, customer experience, compliance, supply chain, marketing. The inbound voice channel is the eighth pillar, and the only reason it has been left off the list for forty years is that nobody could write a number against it. The reference call in this repo is the smallest measurable primitive that closes the gap. 102.36 seconds, two channels, 46 caption segments, all reproducible.

Matthew Diakonov, Written with AI

Published April 26, 20269 min read

Book a 15-minute walkthrough Jump to the primitive

4.9from 200+ restaurants

Eighth pillar named, with a verifiable primitive

Stereo reference call ships in /public/audio

One new row on the existing weekly review

The eighth pillar of restaurant operations management.

One file. Two channels. 46 caption segments. A row on the weekly.

Pillars 1 to 7: every vendor agrees

Pillar 8: voice channel

Primitive: stereo reference call

102.36 seconds, 18 customer + 28 AI

One new row on the weekly review

0:00 / 0:05

Toast pillar listRestaurant365 pillar listNetSuite pillar list7shifts pillar listIndeed ops listMarginEdge pillar listMarketMan pillar listSupy pillar listAll seven, none eight

The seven pillars everyone publishes

Pick any operations management curriculum aimed at restaurant operators and the table of contents converges. Toast publishes one. Restaurant365 publishes one. Indeed publishes one. NetSuite publishes one. Each list reads like a permutation of the same seven categories: inventory, labor, food cost, customer experience, compliance, supply chain, marketing. There is a reason the lists agree, and it is not editorial copy-paste.

Each pillar earned its spot by having a measurement primitive. Inventory has SKU-level counts. Labor has clock punches. Food cost has invoice-line variance. Customer experience has CSAT and review velocity. Compliance has health-code line items. Supply chain has PO-match accuracy. Marketing has campaign attribution. When a vendor builds a category, they build it around the primitive their software can read. No primitive, no pillar.

That is also why the phone has been left out for forty years. It had no primitive. The carrier owned the call detail records and nobody read them. The cashier was sometimes on the phone and sometimes not, and you could not tell from any system the restaurant owned. There was nothing to write a row about.

The eighth pillar, side by side with the seven

Feature	What was on the list before	Pillar primitives
Pillar 1, inventory	SKU-level counts and waste log	Same. Owned by inventory tools.
Pillar 2, labor	Clock punches and schedule cost	Same. Owned by 7shifts, Crunchtime, Toast.
Pillar 3, food cost	Invoice-line variance, theoretical vs actual	Same. Owned by R365, MarginEdge, MarketMan.
Pillar 4, customer experience	CSAT, NPS, review velocity, in-store	Same. Owned by FOH ops and review platforms.
Pillar 5, compliance	Health code, allergen, food safety logs	Same. Owned by HACCP and food safety apps.
Pillar 6, supply chain	PO match, vendor scorecards, lead times	Same. Owned by purchasing platforms.
Pillar 7, marketing	Campaign attribution, new vs return	Same. Owned by CDP and promo tools.
Pillar 8, voice channel	No primitive, no row, footnote under FOH	Stereo recording, 60 Hz envelope, channel-separated transcript, derived metrics line on the weekly.

The primitive: a stereo reference call

A pillar without a primitive is wishful thinking. The eighth pillar needed something a working operations manager could open, listen to, count, and rerun on a sample from their own restaurant. That primitive lives in the repo at /public/audio/dennys-order.mp3 and the parsed result at /src/components/voice-activity-data.ts. The pipeline that built it is /scripts/build-voice-activity-data.py.

0 sReference call duration

0Caption segments

0Customer segments

0AI segments

Every number in that row is reproducible. The recording is a 16-bit stereo WAV with the customer on the left channel and the AI on the right. The Python script reads it with the standard library, splits the channels, runs each one through Deepgram nova-3 multichannel, and emits a 60 Hz RMS envelope per channel plus caption segments grouped on pause and punctuation boundaries. The grouping rules are explicit: max 3.2 seconds per segment, segments split on a pause gap of 0.55 seconds, and 75 characters maximum.

scripts/build-voice-activity-data.py

What the primitive feeds: the data flow

The primitive is not a single number. It is a small graph: stereo audio fans into two transcript channels, two envelopes, and a unified caption stream, then those four artifacts converge into one row on the weekly review. The shape is the contract.

From audio to the weekly review

What the eighth pillar measures, concretely

Calls offered

Total inbound attempts in a window. The phone-equivalent of covers offered. Source of truth is the call log; reference call confirms the parsing pipeline works.

Calls served

Calls that reached either AI completion or successful transfer. The phone-equivalent of covers served. Difference between offered and served is abandonment, and abandonment is revenue.

Mean service time

Average call duration. On the reference call, 102.36 seconds. Pull ten of your own and average. Anywhere from 80 to 140 seconds is normal, depending on menu modifier complexity.

Talk-time ratio AI to customer

On the reference call, 28 AI segments to 18 customer segments. A ratio above 2.0 means the prompts are too verbose. Below 1.0 means the AI is failing to confirm modifiers.

P(wait) at the busy hour

Forward-looking. Erlang C with c=20 and the offered load you read off the busy-hour log. This is the only metric on the weekly that tells you what is about to happen, not what already did.

Pillar 8 row, in one line

calls_offered, calls_served, mean_service_time, talk_time_ratio, P_wait_busy_hour. Five columns. Pasteable into the same review document that already holds the other seven pillars.

Why the talk-time ratio is the new tunable knob

For pillars 1 through 7 the operator already knows the lever. Inventory has par levels. Labor has shift-cost targets. Food cost has portion controls. The eighth pillar inherits one new knob that does not exist anywhere else: the talk-time ratio. It is the ratio of AI envelope energy (or AI caption-segment count, which is faster to compute) to customer envelope energy.

The reference call sits at roughly 1.56 (28 AI segments to 18 customer segments). That is healthy for a first-time caller ordering a multi-modifier item. If a sample of orders drifts above 2.0, the AI is reading too much, the menu prompts are over-verbose, and average call duration is going to climb. If it sinks below 1.0, the AI is skipping modifier confirmation and order accuracy will degrade. Both ends of the range have a concrete fix in the menu configuration.

That tunable did not exist on any of the seven incumbent pillars. It is brand new, owned by the eighth, and it is the kind of knob that operations managers like best: small to compute, slow to drift, easy to act on.

What the weekly review looks like once the eighth row is on it

One block. Eight pillars. The new line is the last one. Same review document, same cadence, same owner, one new exportable row from the analytics dashboard.

ops weekly review, eight pillars

How a real operator reads it

The same recording-plus-transcript pipeline that produced the reference file is what writes the per-call numbers in the production call log. Pull the busy hour, drop the dictionary into a review note, and the review becomes a thirty-second read.

ops_weekly/voice_pillar.py

The five rules an eighth-pillar primitive must satisfy

Rules of a real pillar

Reproducible from a file the operator can open and listen to.
Channel-separated so service time is measurable on the AI side independently.
Sampled at a rate that captures word-level rhythm (60 Hz envelope, here).
Grouped with explicit constants, not vibes (max_dur, pause_gap, max_chars).
Outputs a single line that fits next to the other seven pillars on the weekly.

The path from primitive to pillar to row

Five steps, no new headcount

Open the reference file

Listen to /public/audio/dennys-order.mp3. 102.36 seconds. Customer left, AI right. This is the shape of one served call.

Read the parsed output

Open /src/components/voice-activity-data.ts. 46 caption segments, 18 customer, 28 AI, plus per-channel envelope. This is the shape of the data.

Run the regenerator on a sample of your own calls

Point /scripts/build-voice-activity-data.py at a recording from your own restaurant. Confirm the same shape comes out for your menu.

Aggregate into the five-column row

calls_offered, calls_served, mean_service_time, talk_time_ratio, P_wait_busy_hour. One row per location per week.

Add the row to the existing weekly review

No new dashboard. No new role. The row sits under the seven that already exist. Pillar eight is now on the page.

What the row replaces in the org chart

The eighth pillar does not require a new title on the org chart. It requires one new line on the weekly review owned by whoever already owns FOH service. What it replaces is the dedicated phone employee at $3,000 to $4,000 per month per location, who was a c=1 server with no measurement primitive of their own.

At Mylapore (an 11-location South Indian chain in the Bay Area), the result of putting the eighth pillar on the review was a redeployment of two cashiers from the San Jose location and a projection of $500 in additional revenue per location per day, roughly $2 million a year across the chain.

“We had a Toast dashboard, a 7shifts dashboard, and a MarginEdge dashboard. None of them told us we were missing 30 percent of our calls. The eighth row is the one we were missing.”

Multi-location operator

Bay Area, full-service casual

What changes for the existing seven

Nothing. Pillars 1 through 7 keep their tools, their owners, and their cadence. The eighth pillar is purely additive. It does not displace the POS, the inventory cost system, the labor scheduler, or the food cost variance reporter. It writes the row that nobody had a number for.

That is also the cleanest way to introduce a new pillar. Not by replacing anything. By writing a number where the previous answer was a footnote.

What a busy-hour reading looks like in real numbers

0Calls offered, busy hour

0Calls served

0Mean service time, sec

0%P(wait) at c=20

All four numbers are derivable from the same pipeline that produced the reference call. The first two come from the call log. The third is the average of recorded call durations. The fourth is Erlang C with c=0 and the offered load read off the busy-hour log.

See the eighth pillar on a real restaurant's weekly review.

A 15-minute walkthrough: open the reference call, read the five-column row, see the same numbers from a live operator's busy hour.

Frequently asked questions

Why list seven pillars and call the phone an eighth, instead of just folding it into customer experience or labor?

Because each pillar is defined by its measurement primitive, not by its operational vibe. Inventory has SKU-level counts. Labor has clock punches. Food cost has invoice-line variance. Customer experience has post-meal CSAT and review velocity. The phone has historically had nothing. There was no row in any database the restaurant owned that said a call was offered, served, or abandoned. Folding it into labor erases the only structural fact that matters about it: the phone is a parallel-capacity channel, not a serial labor sub-task. A pillar without a primitive is not a pillar; it is wishful thinking. Once a primitive exists, the eighth pillar is real.

What exactly is the primitive PieLine ships, and how do I verify it for myself?

The repo at github.com/mediar-ai/pieline-phones ships a real call recording at /public/audio/dennys-order.mp3 and the parsed result at /src/components/voice-activity-data.ts. Duration is 102.36 seconds. The Python regenerator at /scripts/build-voice-activity-data.py reads a 16-bit stereo WAV (customer on left channel, AI on right channel), sends it through Deepgram nova-3 with multichannel=true, and writes back a 60 Hz per-channel RMS envelope plus 46 caption segments grouped on pause and punctuation boundaries. The grouping rules are explicit and copyable: max_dur 3.2 seconds, pause_gap 0.55 seconds, max_chars 75 per caption. Of the 46 segments, 18 are the customer side and 28 are the AI side, which gives you a verifiable talk-time ratio for one full order.

How does an operations manager use the primitive on a weekly review?

One row gets added under the labor block. It reads: phone, calls offered, calls served, mean service time, talk-time ratio, P(wait) at the busy hour. All of those numbers are derivable from the same recording-plus-transcript pipeline that produced the reference file. The first three come straight from the call log. Mean service time is the average of recorded call durations. Talk-time ratio is customer envelope energy divided by AI envelope energy. P(wait) at the busy hour is Erlang C with c=20 and the offered load you read off the call log. The line takes thirty seconds to read once it exists.

Is naming this the eighth pillar new, or is everyone secretly already doing it?

Nobody is doing it. Search the published curriculum from NetSuite, Toast, Restaurant365, Indeed, 7shifts, Supy, MarginEdge, MarketMan, and ten others. Every list converges on inventory, labor, food cost, customer experience, compliance, supply chain, marketing. The phone shows up as a footnote inside customer service, not a measurable surface with its own pillar. The reason is honest. None of those vendors have a way to put numbers on the phone. They cover what their software can read. PieLine writes those numbers and exposes them, which is the necessary condition for the pillar to exist in the first place.

What does the talk-time ratio actually tell an operator, in practical language?

It tells you whether the AI is over-confirming or whether the menu is over-complex. On the reference call the AI uses 28 caption segments to the customer's 18, and that is a healthy ratio for a first-time caller ordering a multi-modifier item (lumberjack slam with eggs and bread choice, plus a Coke). If your own talk-time ratio drifts above roughly 2.0 across a sample of orders, that is a signal that menu modifier prompts are too verbose and the AI is reading more than it needs to. If it sits under 1.0 you have an AI that is failing to confirm modifiers, and your order accuracy will be worse than it should be. The ratio is a tunable knob, not just a curiosity.

What is the actual structure of /scripts/build-voice-activity-data.py and why is it in the repo?

It is a 100-ish line Python script with no dependencies beyond the standard library. It opens a stereo WAV with the wave module, reads frames into 16-bit samples, splits left and right channels, computes a per-channel RMS envelope at 60 Hz, then sends the file body to https://api.deepgram.com/v1/listen with model=nova-3 and multichannel=true. It groups the returned word-level timestamps into caption segments using three constants (pause_gap=0.55, max_dur=3.2, max_chars=75) and writes the result to /src/components/voice-activity-data.ts as a typed export. It is in the repo because the whole point of the eighth pillar is that the measurement is reproducible. If you cannot rerun the script and get the same shape of data on your own restaurant's call, the primitive is not real.

How does this change the org chart for a multi-location operator?

Director of Operations and VP of Operations roles already exist for the seven pillars. The eighth pillar does not need a new role; it needs a new line item on the existing role's weekly. That line item is owned by whoever owns the FOH service KPI. The hire it eliminates is the dedicated phone employee at $3,000 to $4,000 per month per location. Mylapore, an 11-location South Indian chain in the Bay Area, redeployed two cashiers from a single San Jose location after going live and is projecting $500 of additional revenue per location per day from eliminating the phone bottleneck. That is roughly $2M in incremental revenue per year across the chain, and it is attributable to one new line on the weekly review.

What if my operations stack is already on Toast, Restaurant365, MarginEdge, or 7shifts?

Keep all of them. Each one owns the pillar it owns. None of them write the row that PieLine writes, because none of them ingest the phone channel. Adding a phone-channel layer is purely additive: it does not displace the POS, the inventory cost system, the labor scheduler, or the food cost variance reporter. It adds the eighth row of the weekly KPI dashboard, the one nobody had a number for. That row is exportable as CSV from the analytics dashboard and pasteable into the same review document that already holds the other seven pillars.

What are the exact numbers from the reference call I can use to anchor my own analysis?

Duration: 102.36 seconds. Caption segments: 46 total. Customer segments: 18. AI segments: 28. Talk-time-segment ratio AI:customer: about 1.56. Sampling rate of the per-channel envelope: 60 Hz. Caption grouping rules: max 3.2 seconds per segment, pause gap of 0.55 seconds breaks segments, max 75 characters per segment. Concurrency ceiling on the default plan: 20 simultaneous calls. Service rate per server at this mean service time: 35.17 calls per hour per server. Hard ceiling per location: 703 calls per hour. None of those numbers were invented for this page; they are derivable from one file you can listen to.

Where do I read the seven pillars side by side with the eighth without sounding like I made it up?

Read the operations management curriculum from any vendor. Toast publishes a pillar list. Restaurant365 publishes a pillar list. NetSuite publishes a pillar list. Indeed publishes one, 7shifts publishes one, Supy publishes one, MarketMan publishes one. Pick any three. Note what is on the list and what is not. The phone is not on any of them as a first-class pillar with its own metrics, only as a sub-bullet under customer service or front-of-house. The eighth pillar framing is a description of what the field already does plus the one thing it has never had the data to do.