Turning Large-Scale Financial Analysis into Courtroom-Ready Narratives

The analytical rigor that makes a financial expert's conclusions correct is rarely, by itself, what makes them persuasive in a courtroom. Translating terabytes of transaction data, multi-factor regression outputs, and probabilistic inferences into testimony that survives cross-examination—and that a jury or judge can act on—requires a distinct and underappreciated discipline. This article examines the principles and techniques that govern the translation from analysis to evidence.

The Communication Gap in Financial Expert Testimony

Financial economics routinely produces analyses whose validity depends on assumptions that are both defensible and opaque to non-specialists: stationarity of return series, appropriate benchmark construction, selection of event windows, and independence of observations. A failure to communicate these assumptions—and their sensitivity—does not merely weaken testimony aesthetically; it creates genuine vulnerability to Daubert challenges and, more broadly, to the kind of methodological cross-examination that has become standard practice in securities litigation.

Rule 702 of the Federal Rules of Evidence requires that expert testimony be (i) based on sufficient facts or data, (ii) the product of reliable principles and methods, and (iii) the result of the reliable application of those methods to the facts of the case. Courts interpreting Daubert v. Merrell Dow Pharmaceuticals, Inc. (509 U.S. 579, 1993) have interpreted this to mean that the expert must be prepared not merely to assert conclusions but to explain the methodology in sufficient detail that the factfinder—or a reviewing judge—can assess its soundness. The communication of that methodology is therefore a legal requirement, not merely a pedagogical nicety.

From Dataset to Narrative: Four Structural Principles

Lead with the Question, Not the Method

Every piece of quantitative analysis should be introduced by the question it answers, not the technique used to answer it. Judges remember "Did the stock price move abnormally on the date the defendant sold?" more readily than "I ran an OLS regression with heteroskedasticity-corrected standard errors."

Quantify Uncertainty Explicitly

Confidence intervals and p-values are not admissions of weakness—they are demonstrations of intellectual honesty. An expert who presents only point estimates and refuses to discuss uncertainty is more vulnerable on cross, not less.

Anticipate the Counterfactual

Every damages or profit calculation implies a but-for world. That counterfactual must be explicitly modeled and its assumptions defended. Opposing counsel will construct their own—and the factfinder will choose between two competing narratives.

Make the Data Visible

Exhibits that present the raw data alongside the analytical conclusion—scatter plots, annotated time series, transaction timelines—reduce the cognitive burden on the factfinder and make the analytical leap more credible, not less.

Working with Large Datasets: Visualization for the Courtroom

In complex securities matters, the underlying dataset may span years of trading activity across dozens of securities. Distilling that material into exhibits usable in deposition or at trial requires deliberate choices about aggregation level, time scale, and visual encoding. The goal is not simplicity for its own sake but the preservation of the key inferential content at the resolution appropriate to the factfinder.

The following Python code demonstrates an annotated event-study return chart, the single most common exhibit type in securities fraud matters. The exhibit plots cumulative abnormal returns (CARs) against a timeline of material disclosures, with shaded confidence bands and labeled events—a format that courts have repeatedly found adequate for communicating the statistical analysis underlying a fraud-on-the-market damages theory.

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.dates as mdates
from matplotlib.patches import FancyArrowPatch

def plot_car_exhibit(
    dates: pd.DatetimeIndex,
    car: np.ndarray,
    car_se: np.ndarray,
    events: list[dict],
    title: str = "Cumulative Abnormal Returns",
    output_path: str = "car_exhibit.pdf"
) -> None:
    """
    Generate a publication-quality CAR exhibit for litigation use.

    Parameters
    ----------
    dates      : DatetimeIndex of trading days
    car        : array of cumulative abnormal returns
    car_se     : array of cumulative standard errors (for confidence band)
    events     : list of dicts with keys 'date', 'label', 'type'
                 type: 'corrective' | 'neutral' | 'fraud'
    title      : chart title string
    output_path: file path for PDF export
    """
    fig, ax = plt.subplots(figsize=(12, 5.5))

    # Confidence band (±1.96 SE)
    upper = car + 1.96 * car_se
    lower = car - 1.96 * car_se
    ax.fill_between(dates, lower, upper,
                    alpha=0.15, color='#8B1A1A', label='95% Confidence Band')

    # CAR line
    ax.plot(dates, car, color='#8B1A1A', linewidth=1.8, label='Cumulative Abnormal Return')
    ax.axhline(0, color='#2C2C2C', linewidth=0.7, linestyle='--')

    # Annotate events
    colors = {'corrective': '#8B1A1A', 'neutral': '#5A5A5A', 'fraud': '#2C2C2C'}
    for ev in events:
        ev_date = pd.Timestamp(ev['date'])
        ev_color = colors.get(ev.get('type', 'neutral'), '#5A5A5A')
        ax.axvline(ev_date, color=ev_color, linewidth=1.0,
                   linestyle=':', alpha=0.8)
        # Find closest CAR value for annotation placement
        idx = dates.get_indexer([ev_date], method='nearest')[0]
        y_val = car[idx] if idx >= 0 else 0
        ax.annotate(
            ev['label'],
            xy=(ev_date, y_val),
            xytext=(0, 18),
            textcoords='offset points',
            fontsize=7.5,
            color=ev_color,
            ha='center',
            arrowprops=dict(arrowstyle='->', color=ev_color, lw=0.8),
            bbox=dict(boxstyle='round,pad=0.25', fc='white',
                      ec=ev_color, alpha=0.85, linewidth=0.6)
        )

    # Formatting
    ax.set_title(title, fontsize=13, fontweight='bold',
                 fontfamily='serif', pad=14, color='#2C2C2C')
    ax.set_ylabel('Cumulative Abnormal Return', fontsize=9, color='#5A5A5A')
    ax.yaxis.set_major_formatter(
        plt.FuncFormatter(lambda y, _: f'{y*100:.1f}%'))
    ax.xaxis.set_major_formatter(mdates.DateFormatter('%b %Y'))
    ax.xaxis.set_major_locator(mdates.MonthLocator(interval=2))
    plt.xticks(rotation=30, ha='right', fontsize=8)
    ax.grid(axis='y', alpha=0.25, linewidth=0.5)
    ax.spines[['top', 'right']].set_visible(False)
    ax.legend(fontsize=8, framealpha=0.9)

    plt.tight_layout()
    plt.savefig(output_path, dpi=300, bbox_inches='tight')
    plt.close()
    print(f"Exhibit saved: {output_path}")

Listing 1. Annotated CAR exhibit generator. In production, substitute actual model residuals for car and propagate the estimation-window variance correctly into car_se per MacKinlay (1997).

The Expert Report as Persuasive Architecture

An expert report is a legal document, a scientific disclosure, and a persuasive argument simultaneously. The structure that best serves all three functions follows a consistent logic: (1) a statement of assignment, including the specific questions posed and any limitations on scope; (2) a summary of materials reviewed; (3) the qualifications of the expert; (4) a concise statement of opinions; and (5) the detailed analysis supporting each opinion, in an order that mirrors the factual narrative of the case rather than the chronology of the analyst's work.

Surveying the empirical literature on jury decision-making in complex cases, Koehler and Shaviro (1990) found that jurors respond more favorably to probabilistic evidence when it is presented with a clear baseline comparison than when presented in isolation—a finding that supports the common practice among economic experts of benchmarking abnormal returns against a market-wide or peer-group distribution rather than presenting absolute return figures. More recently, Niedermeier, Kerr, and Messé (1999) documented that jurors with higher numeracy are better able to integrate statistical evidence with narrative testimony, but that even high-numeracy jurors benefit from graphical representations of probabilistic arguments.

The single most common error in expert report drafting is leading with the methodology rather than the conclusion. A factfinder's attention is highest at the beginning of a document. Use that attention to establish the analytical bottom line—then deploy the technical apparatus in service of explaining and defending it.

Cross-Examination Resilience

Qualitatively, the most effective cross-examinations of financial experts share a common structure: they do not attack the correctness of the arithmetic but the reasonableness of the choices that generated the inputs. Alternative benchmark selection, different event window endpoints, and varying model specifications are all legitimate tools of opposing counsel. An expert who has stress-tested their conclusions against these alternatives—and documented that the core results are robust—is materially more difficult to impeach than one who presents only the baseline analysis.

The R code below demonstrates sensitivity analysis over a range of estimation window lengths for an OLS market model, a standard robustness check that should be reported in any event study expert report.

library(tidyverse)

sensitivity_event_study <- function(
  returns_df,          # data.frame: date, ret, mkt_ret
  event_date,          # Date
  window_lengths = c(60, 90, 120, 180, 252),  # estimation window (days)
  event_window   = c(-1, 1)   # event window around event_date
) {
  event_date <- as.Date(event_date)

  results <- map_dfr(window_lengths, function(n_est) {
    # Identify estimation and event windows
    evt_idx <- which(returns_df$date == event_date)
    if (length(evt_idx) == 0) return(NULL)

    est_rows <- (evt_idx - n_est):(evt_idx - abs(event_window[1]) - 1)
    evt_rows <- (evt_idx + event_window[1]):(evt_idx + event_window[2])

    if (any(est_rows < 1) || any(evt_rows > nrow(returns_df))) return(NULL)

    est_data <- returns_df[est_rows, ]
    evt_data <- returns_df[evt_rows, ]

    # Estimate market model
    fit <- lm(ret ~ mkt_ret, data = est_data)
    sigma_eps <- summary(fit)$sigma

    # Compute abnormal returns and CAR
    evt_data <- evt_data %>%
      mutate(
        expected_ret = coef(fit)[1] + coef(fit)[2] * mkt_ret,
        abnormal_ret = ret - expected_ret
      )

    car   <- sum(evt_data$abnormal_ret)
    n_evt <- nrow(evt_data)
    car_se <- sigma_eps * sqrt(n_evt)
    t_stat <- car / car_se

    tibble(
      est_window = n_est,
      car        = car,
      car_se     = car_se,
      t_stat     = t_stat,
      p_value    = 2 * pt(-abs(t_stat), df = n_est - 2),
      significant = p_value < 0.05
    )
  })

  results
}

# Usage: run sensitivity analysis and plot
# results <- sensitivity_event_study(daily_returns, "2023-11-14")
# ggplot(results, aes(est_window, car, ymin=car-1.96*car_se,
#                     ymax=car+1.96*car_se)) +
#   geom_pointrange(color = "#8B1A1A") +
#   geom_hline(yintercept = 0, linetype = "dashed") +
#   scale_y_continuous(labels = scales::percent) +
#   labs(x = "Estimation Window (days)", y = "CAR",
#        title = "Sensitivity of CAR to Estimation Window Choice") +
#   theme_minimal()

Listing 2. Estimation-window sensitivity analysis in R. Robust conclusions should be statistically significant across all or most window specifications. Document the range of results, not just the preferred specification.

Working with Counsel: Practical Protocols

Effective expert testimony is a collaborative product. The analyst brings methodological authority; the attorney brings case theory and evidentiary context. Misalignment between the two is the most common cause of expert reports that are technically sound but strategically ineffective. A forensic analytics engagement should include, at minimum, two substantive pre-report conferences: one to align on the analytical questions to be addressed and the evidentiary constraints under which they will be answered, and a second to review preliminary findings before the report is drafted. The goal of the second conference is not to change the conclusions—which must reflect the analyst's independent judgment—but to ensure that the conclusions are framed in terms that map onto the legal theory of the case.

Mnookin (2008) notes that the adversarial structure of litigation creates persistent pressure on experts to shade their conclusions toward their retaining party's position—pressure that must be actively resisted not merely for ethical reasons but because compromised independence is the single greatest vulnerability in cross-examination. The expert who can demonstrate, through documentation and methodology, that their conclusions would have been identical regardless of who retained them occupies a materially stronger position at trial.

References

Daubert v. Merrell Dow Pharmaceuticals, Inc., 509 U.S. 579 (1993).
Federal Rules of Evidence, Rule 702: Testimony by Expert Witnesses (as amended through 2023).
Koehler, J. J., & Shaviro, D. N. (1990). Veridical verdicts: Increasing verdict accuracy through the use of overtly probabilistic evidence and methods. Cornell Law Review, 75(1), 247–279.
MacKinlay, A. C. (1997). Event studies in economics and finance. Journal of Economic Literature, 35(1), 13–39.
Mnookin, J. L. (2008). Expert evidence, partisanship, and epistemic competence. Brooklyn Law Review, 73(3), 1009–1033.
Niedermeier, K. E., Kerr, N. L., & Messé, L. A. (1999). Jurors' use of naked statistical evidence: Exploring bases and implications of the Wells effect. Journal of Personality and Social Psychology, 76(4), 533–542.
Saks, M. J., & Koehler, J. J. (2005). The coming paradigm shift in forensic identification science. Science, 309(5736), 892–895.
Walters, A. A. (1963). Production and cost functions: An econometric survey. Econometrica, 31(1/2), 1–66. [cited for sensitivity analysis methodology]