2026-05-27agentscode

A Fresh Look at Lamarckian Evolution and the Baldwin Effect

Inès Benito, Johannes F. Lutzeyer, Benjamin Doerr

PDF preview unavailable

Key claim

Baldwinian and Lamarckian evolution outperform Darwinian evolution.

This paper revisits Baldwinian and Lamarckian evolution in evolutionary algorithms, showing they outperform traditional Darwinian methods in various scenarios. The authors provide a set of generalist parameters that can benefit practitioners, highlighting the practical implications of their findings.

In plain English

Novelty

8.0/10

The paper provides a significant new comparison of evolutionary algorithms, extending existing benchmarks and demonstrating superior performance.

Reliability

8.0/10

The empirical results are based on comprehensive experiments across multiple datasets, supporting the claims made.

Deep reliability assessment

The methodology supports the claim that, on GraphBench random-graph MIS and Max-Cut instances under a fixed 40,000-fitness-evaluation budget, Baldwinian/Lamarckian local-search-augmented EAs outperform Darwinian EAs and the listed neural baselines. Broader claims about practical dominance or solver-level performance are less supported because evaluation is limited to two graph optimization tasks, mostly synthetic/random graph families, and comparisons depend strongly on tuning budget, implementation cost of local search, and the chosen specialized solvers.

Reproducibility

Yes: the paper states that complete source code is available at https://github.com/Hypatia-II/polerina, and experiments use the public GraphBench benchmark with described train/test splits, parameter grid search, and fixed evaluation budget.

Discussion questions

1.Does the apparent advantage of Baldwinian evolution come from a genuinely better evolutionary search dynamic, or mainly from giving the algorithm access to a strong local-search oracle during fitness evaluation?
2.For builders solving routing, scheduling, or graph optimization problems, when is a memetic EA preferable to directly running a specialized heuristic or exact solver with the same wall-clock budget?
3.What result would falsify the paper’s main takeaway: poor performance on non-random real-world graph distributions, loss of advantage under equal wall-clock time instead of equal fitness evaluations, or failure to beat well-tuned domain heuristics?

Key figure

No Figure 1 or key architectural diagram is included in the provided excerpt; the central setup is a comparison of Darwinian, Baldwinian, Lamarckian, and mixed L-B evolutionary algorithms with or without local-search-based offspring evaluation/replacement.

Benchmark results

GraphBench ER Smallmean independent set size, higher is better: 33vs best deep learning baseline: GIN, 25.42+7.58 absolute

GraphBench ER Largemean independent set size, higher is better: 42.06vs best deep learning baseline: GIN, 26.28+15.78 absolute

GraphBench BA Smallmean independent set size, higher is better: 143.42vs best deep learning baseline: GIN, 100.16+43.26 absolute

GraphBench BA Largemean independent set size, higher is better: 433.63vs best deep learning baseline: GIN, 135.00+298.63 absolute

GraphBench RB Smallmean independent set size, higher is better: 20.1vs best deep learning baseline: GIN, 17.29+2.81 absolute

GraphBench RB Largemean independent set size, higher is better: 37.61vs best deep learning baseline: GIN, 14.00+23.61 absolute

GraphBench ER Smallmean cut size, higher is better: 2910.4vs best deep learning baseline: GIN, 2327.9+582.5 absolute

GraphBench ER Largemean cut size, higher is better: 23862.1vs best deep learning baseline: GIN, 20878.0+2984.1 absolute

GraphBench BA Smallmean cut size, higher is better: 412.7vs best deep learning baseline: GIN, 397.0+15.7 absolute

GraphBench BA Largemean cut size, higher is better: 1248.1vs best deep learning baseline: GIN, 1044.1+204.0 absolute

GraphBench RB Smallmean cut size, higher is better: 2822.2vs best deep learning baseline: GIN, 2106.7+715.5 absolute

GraphBench RB Largemean cut size, higher is better: 32001.3vs best deep learning baseline: GIN, 24748.0+7253.3 absolute

GitHub1 repo

Hypatia-II/polerinaOfficial