(Rank 3/3395) The hardest optimization competition ever on Kaggle!

I’ve just won a very high place (3/3395), equivalent to a gold medal (my 10th), in the Santa 2025 - Christmas Tree Packing Challenge (yes you heard it right, it’s another Santa!).

In this contest we are tasked with a traditional (but really hard) bin-packing optimization problem in 2D. The objective is to pack n Christmas trees (with known and fixed shape) into a square box with minimal side length, where n goes from 1 to 200.

Sound simple right? But the thing is completely opposite.

If you have the chance to go through our codes, you’ll realize the hard truth: it consumes lots of my energy in 2 months, as well as all my teammates.

Competing on Kaggle is never easy, and never will be.

Occupying the top 10 are always the best of the best of the world. All top heuristics guys from atcoder.jp joined this contest (saharan, c-number, nagiss, bowwow, montplusa…). As well, most of Kaggle veterans on Santa series also present (Lucien, Rafbill, solverworld…).

Final solution of 200 configs (n=1 to n=200 trees)

We had a really great time together, from both teammate and opponent perspectives. Kaggle Santa is always very fun and energetic (so contrasting with 95% others which are public/private LB contests)

At the end, Jeroen Cottaar, a rising star, won the 1st place impressively, which also made him a Kaggle competition Grandmaster, and become World Number 2 on the global ranking. Around 7 days before closure, my team went to top (1st) place with really high hope of winning it totally. However, Jeroen’s excellence is too good to resist, and the 2nd team (Kirare) found a, according to them, “bullet” idea - that got them jump significantly to the 2nd place, despite a large gap with ours.

At the end, after all dust settles down, this is generally an excellent result to me, and I’m so happy about it. Personally I’ve tried so hard, pushed myself so hard, that even now, which is 5 days after closure, I’m still feeling the exhaustion of the tremendous 2-month work that I’ve experienced.

Below is the write-up from the whole team (although I’ve made some minor modifications to target better audience here). Enjoy!

Introduction

A schematic overview of our solution can be found below. It consists of the following main components, which we will discuss more in-depth subsequently:

Rectangle combination to reach good square solutions for large N from smaller rectangular solutions
A heavily modified version of Sparrow that supports warm starts, directly optimizes the square objective, increases numerical precision, and introduces several new hyperparameters to control optimization behavior (Rust)
A very fast C++ simulated annealing (SA) and “push” procedure to slightly perturb and compress square solutions (final step for each N)
A seamless workflow to fully exploit teamwork and distributed compute resources

Solution overview

Rectangle combination

Early in the competition, we achieved strong results by combining optimized rectangles. We generated a large library of rectangle bands with diverse aspect ratios using lattice generation and the vanilla Sparrow algorithm with different strip heights.

Rectangles from this library were then recursively combined to form larger rectangles and squares. While none of our final results were generated directly by this algorithm, these solutions served as valuable seeds for subsequent steps.

Below is an example of an output of the rectangle combination algorithm (with some post-processing already applied). Note that the rectangles in the green & blue zone are, in turn, again combinations of other rectangles.

An example of rectangle combination

As a funny coincidence, if we plotted out the minimum and maximum dimensions of our generated rectangles, birds (sparrows?) seem to appear! What a phenomenon!

An example of rectangle combination

By applying purely this algorithm, a score of roughly 70.0 can be achieved.

Simulated annealing + Compression

Early on, we translated one of the SA notebooks that was made publicly available in the first days of the competition into C++ and added multi-threading to it. Oleh, who only merged with us in January 2026 was using another public notebook SA algorithm translated to C++.

An outline of the SA algorithm is provided in pseudo-code below:

Initialize current layout and best layout
Compute initial bounding square size
Set temperature T = T_start

For many iterations:
  Pick one random tree
  Propose a small random move (dx, dy) and rotation (dθ)
  If the moved tree overlaps others: reject

  Compute new bounding square size (objective)
  Δ = new_score − current_score

  If Δ < 0:
    accept move
  Else:
    accept move with probability exp(−Δ / T)

  If accepted:
    update current layout
    update best layout if improved

  Gradually decrease T

In addition, we implemented a simple but initially effective post-processing algorithm that simulates shaking a tray containing the polygons. This was also implemented in C++.

An outline of this pushing algorithm is provided in pseudo-code below:

Repeat until no improvement:
  For each predefined push direction (e.g. left+down, right+up):
    For decreasing step sizes:
      For each tree:
        While tree can move slightly in push direction (and rotate a bit)
          without collisions or boundary violations:
            accept the move

    If resulting layout has smaller bounding square:
      accept it

By combining the rectangle combiner with these algorithms (to post-process both the rectangles in our library as well as the intermediate & final solutions of the combiner), a score of roughly 69.50 can be achieved.

Extending the sparrow algorithm

While we started using the “vanilla” implementation of the sparrow (credit to @jern97 and @tonywauters) algorithm quite early on to create rectangles for our rectangle library, we never fully explored it in depth. In January, Oleh joined our team and he had mostly been working on extending that implementation to build in several inductive biases tailored to our specific problem.

Directly optimizing the square: Sparrow’s objective was modified to minimize both width and height simultaneously, instead of minimizing width while keeping height fixed. Aligning Sparrow’s objective with the competition metric was a natural first extension.

Warm-start: allow to pass along an initial solution for sparrow to start from: We extended Sparrow to accept an initial solution, allowing it to warm-start from previously obtained layouts. This enabled us to seed Sparrow with the diverse solutions produced by the rectangle combination algorithm.

Enforce symmetry in the solver (4-way and 2-way symmetry): To significantly reduce the search space, we enforced symmetry constraints. Both 2-way and 4-way symmetry were supported for values of N divisible by 2 and 4, respectively. While few symmetries remain in the final solutions, this approach helped Sparrow converge to strong seed solutions early in optimization.

Orientation biases to enforce lattice-like structures: To reduce rotational noise and stabilize global structure, we apply a soft orientation bias that encourages items to align with a dominant layout axis. For a placement with rotation θ, the penalty is

where w is $orientation_bias_weight$ (hyper-parameter), θ* is a fixed target orientation_bias_angle (hyper-parameter), and Δ is the smallest angular difference modulo 180°.

The penalty is made center-aware by setting B(c) equal to

where r is the normalized distance to the container center, b = orientation_bias_center_boost (hyper-parameter), and p = orientation_bias_center_power (hyper-parameter), enforcing stronger alignment in the center and softer constraints near edges. The bias strength adapts during optimization (growing on improvement, decaying otherwise, and rescaled at compression start).

Generating and kicking seed solutions Additional to the seeds generated by our rectangle combination algorithm, we also implemented down- and up-sampling routines to generate initial Sparrow solutions from solutions at other values of N. Below is a simple example of how to up sample. We basically add a tree randomly and let sparrow insert it intelligently.

Adding 1 more tree into a near-perfect square doesn't make sparrow algorithm struggle too much

Much of the effort in the later stages of the competition was spent tuning these hyperparameters and carefully selecting both the target N and the N values to down- or up-sample from, often guided by visual inspection.

Before running Sparrow, initial solutions were “kicked” to help escape local optima by applying global rotations (rotating the entire solution by a few degrees) and scaling (multiplying all (x, y) coordinates by a factor slightly larger than 1).

Teamwork

Ideation and balanced workload distribution are major challenges in Kaggle team settings, especially in a competition that demands relentless optimization to fully exploit available compute resources. Taking into account different backgrounds and time zones, we adopted a simple and efficient collaboration scheme:

Kaggle notebooks (with internet enabled), automatically submitting each newly found improvement for a single N as one submission (thanks to Kaggle for allowing 100 submissions per day)
Reporting scores and parameters directly to Slack from those notebooks for further analysis and discussion

Below an example of messages posted by our slackbot when improvements were found:

Receiving spontaneous improvements notifications from our remote/local instances is really happy

With this workflow, all team members could continuously run multiple notebooks 24/7 with minimal effort, alongside their local machines. Merging improvements and distributing new ideas was fast and straightforward.

Interesting findinds

The median angles for each of the solutions seem to show a clear pattern. Optimal solutions tend to have a lattice structure of trees with around 22-24 degrees and 202-204 degrees.

An interesting pattern of median tree angle wrt number of trees.

LLMs It would be dishonest not to mention the use of AI tools in this competition. While all core ideas originated from the team (otherwise a gold medal would not be possible), AI tools proved valuable for supporting tasks. In particular, they helped translate Python prototypes into efficient C++ implementations and assisted in modifying the Sparrow Rust codebase (a language unfamiliar to most of our team) making the extensions described above significantly easier to implement.

Code

My code is shared here (also embedded below for viewer’s convenience). Enjoy!