Accounting

Everything About Accounting

2 articles

Accounting methods, workflows, and automation research

AIMachine LearningLLMAutomationComplianceAccountingBeancount

Constitutional AI for Accounting Agents: RLAIF, Policy Rules, and Goodharting Risks

Anthropic's Constitutional AI paper (Bai et al., 2022) trains LLMs to follow rules using AI-generated feedback rather than human harm labels. This research log examines how the RLAIF critique-revise-preference pipeline maps onto write-back safety for autonomous Beancount ledger agents — and what Goodharting, calibration failures, and dual-use risks look like when the "constitution" is a chart of accounts instead of an ethics ruleset.

LLMAccountingAIFinancial StatementsFinancial LiteracyMachine LearningAutomation

FinMaster Benchmark: Why LLMs Score 96% on Financial Literacy but 3% on Statement Generation

FinMaster (arXiv:2505.13533) benchmarks o3-mini, Claude 3.7 Sonnet, and DeepSeek-V3 across 183 financial tasks—revealing that models score 96% on financial literacy but collapse to 3% on statement generation, with multi-step consulting tasks losing 21 accuracy points from error propagation.

Get started with Beancount.io

Take control of your finances with our open-source double-entry accounting system. Start your ledger today.

Get Started Free View Pricing

Built with transparency • Version controlled • AI-powered

Everything About Accounting

Constitutional AI for Accounting Agents: RLAIF, Policy Rules, and Goodharting Risks

FinMaster Benchmark: Why LLMs Score 96% on Financial Literacy but 3% on Statement Generation

Get started with Beancount.io

Getting Started

Features

Community

Legal