2 · Standardization Scale

Coefficient of variation as a proxy for institutional scribal control

Paper section: Results §3.1 · Notebooks: 2.1, 2.2, 2.3

Overview

Beyond the directional shift in median h/w ratios, a second signal is encoded in the dispersion of those ratios: how tightly tablets in a period cluster around their period median. Eerkens & Bettinger (2001) proposed that artifact Coefficient of Variation (CV) below ~17% indicates motor-skill limits (standardization by habit), while higher CV indicates less-constrained production. We use CV as a proxy for the degree of institutional control over scribal format.

The Coefficient of Variation

For a period with median h/w ratio \(\tilde{r}\) and standard deviation \(\sigma\):

\[\text{CV} = \frac{\sigma}{\bar{r}} \times 100\%\]

where \(\bar{r}\) is the mean ratio. Lower CV = tighter clustering around the period norm = stronger scribal standardization.

Period standardization ranking

Code
import pandas as pd

df = pd.read_csv("../../paper/figures/period_summary_stats.csv")
df_sorted = df.sort_values('CV').reset_index(drop=True)
df_sorted['Rank'] = range(1, len(df_sorted) + 1)
df_sorted['CV_pct'] = (df_sorted['CV'] * 100).round(1)
df_sorted['Median'] = df_sorted['Median'].round(3)

display_cols = ['Rank', 'Period', 'n', 'Median', 'CV_pct', 'Orientation']
df_display = df_sorted[display_cols].copy()
df_display.columns = ['Rank', 'Period', 'n', 'Median h/w', 'CV (%)', 'Orientation']

highlight = ['Ur III', 'Neo-Babylonian', 'Achaemenid']

def highlight_rows(row):
    if row['Period'] in highlight:
        return ['font-weight: bold; background-color: #fff3e0'] * len(row)
    return [''] * len(row)

df_display.style \
    .apply(highlight_rows, axis=1) \
    .background_gradient(subset=['CV (%)'], cmap='YlOrRd') \
    .format({'n': '{:,}'}) \
    .set_caption("CV standardization ranking — ascending order")
Table 1: Standardization ranking of all periods by Coefficient of Variation (ascending = most standardized). Bold rows are the three benchmark periods discussed in the paper.
(a) CV standardization ranking — ascending order
  Rank Period n Median h/w CV (%) Orientation
0 1 Ur III 22,504 1.125000 25.100000 portrait
1 2 Early Old Babylonian 2,560 1.268000 27.400000 portrait
2 3 Proto-Elamite 1,457 1.333000 27.900000 portrait
3 4 Uruk IV 1,808 1.276000 31.900000 portrait
4 5 Old Akkadian 3,397 1.280000 32.100000 portrait
5 6 Old Babylonian 15,884 1.282000 34.200000 portrait
6 7 ED IIIb 1,895 1.033000 36.300000 portrait
7 8 ED IIIa 414 1.559000 40.600000 portrait
8 9 Hellenistic 453 0.900000 41.200000 landscape
9 10 Middle Babylonian 3,023 1.000000 46.000000 landscape
10 11 Middle Assyrian 1,056 1.110000 50.000000 portrait
11 12 Uruk III 4,985 1.192000 50.200000 portrait
12 13 Hittite 161 1.000000 50.500000 landscape
13 14 ED I-II 457 1.733000 51.900000 portrait
14 15 Old Assyrian 1,124 1.106000 53.600000 portrait
15 16 Neo-Assyrian 3,379 1.158000 53.900000 portrait
16 17 Ebla 3,229 1.026000 56.900000 portrait
17 18 Lagash II 274 1.305000 60.500000 portrait
18 19 Middle Elamite 601 0.863000 74.000000 landscape
19 20 Achaemenid 1,718 0.786000 78.500000 landscape
20 21 Neo-Babylonian 7,133 0.741000 80.700000 landscape

Three benchmark periods

The standardization landscape is anchored by three historically significant periods:

Period Median h/w CV Interpretation
Ur III 1.125 25.1% Tightest portrait norm — bureaucratic mass production
Achaemenid 0.786 78.5% Widest overall, but pixel IQR = 0.150 (tight central tendency)
Neo-Babylonian 0.741 80.7% Loosest dispersion — diverse archive genres

The apparent paradox of Achaemenid high CV alongside tight pixel IQR is resolved by distinguishing two kinds of standardization:

  • Orientation standardization: the central mass of tablets clusters tightly around the landscape norm (IQR = 0.150), suggesting the landscape format was institutionally enforced for canonical documents
  • Global shape standardization: a tail of outlier formats expands the overall CV — likely non-canonical genres (astronomical diaries, school texts, ritual tablets) that did not conform to the administrative format

The standardization curve

Code
import pandas as pd, matplotlib.pyplot as plt, numpy as np

df = pd.read_csv("../../paper/figures/period_summary_stats.csv")

chron_order = ['Uruk IV','Uruk III','Proto-Elamite','ED I-II','ED IIIa','ED IIIb',
               'Ebla','Old Akkadian','Lagash II','Ur III','Early Old Babylonian',
               'Old Babylonian','Old Assyrian','Middle Assyrian','Middle Babylonian',
               'Middle Elamite','Hittite','Neo-Assyrian','Neo-Babylonian',
               'Achaemenid','Hellenistic']

df['_rank'] = df['Period'].map({p: i for i, p in enumerate(chron_order)})
df = df.sort_values('_rank').dropna(subset=['_rank'])

fig, ax = plt.subplots(figsize=(10, 4))
colors = ['#b5622e' if p in ['Ur III', 'Neo-Babylonian', 'Achaemenid'] else '#4a6fa5'
          for p in df['Period']]
ax.bar(range(len(df)), df['CV'] * 100, color=colors, alpha=0.85,
       edgecolor='white', lw=0.5)
ax.set_xticks(range(len(df)))
ax.set_xticklabels(df['Period'], rotation=45, ha='right', fontsize=7.5)
ax.axhline(17, color='grey', ls='--', lw=1.2, alpha=0.7,
           label='Eerkens & Bettinger (2001) motor-skill threshold (17%)')
ax.set_ylabel('CV of h/w ratio (%)', fontsize=9)
ax.set_title('Scribal standardization by period — lower CV = tighter institutional control', fontsize=10)
ax.legend(fontsize=8)
plt.tight_layout()
plt.show()
Figure 1: CV of h/w ratio by period, sorted chronologically. High CV = low scribal control over format; low CV = tight institutional standard.

Eerkens & Bettinger framework

Eerkens & Bettinger (2001) proposed that artifact CV ≤ 17% signals production under cognitive templates — mental representations of a target form that constrain production even without measurement tools. CV 17–57% indicates reduced copying fidelity; above 57%, variation is essentially unconstrained.

Applied to tablet morphology:

  • No period reaches the 17% threshold, confirming that scribal shape was always more variable than true standardized craft production (e.g., mass-produced ceramics)
  • Ur III (25.1%) comes closest — consistent with the interpretation of Ur III as the most bureaucratically controlled period of cuneiform writing
  • Neo-Babylonian (80.7%) and Achaemenid (78.5%) fall in the “unconstrained” range globally, but the Achaemenid pixel IQR signal suggests local standardization within the dominant genre

George’s (2010) concept of “fixed balance” — scribal templates for geometrically harmonious proportions — complements this framework: the portrait norms of the 3rd and early 2nd millennium (h/w ≈ 1.1–1.3) and landscape norms of the 1st millennium (h/w ≈ 0.75–0.85) represent two successive cognitive templates, each maintained with varying institutional force.

Note

Next: Period Classification → — how well machine learning models recover historical periods from silhouette shape alone.