The Art of Writing Efficient Assembly Code: A Balancing Act Between Performance, Size, and Readability

Joao Cardoso

Azure Developer

Published Apr 25, 2023

Assembly language is a low-level programming language that allows developers to directly manipulate a computer's hardware. It is widely used in the development of video games, particularly retro games, where code efficiency is critical for achieving high-performance gameplay on limited hardware.

When programming in assembly, developers need to balance the size of the code and its speed with code readability. In video game development, this trade-off is particularly important because the size of the code can impact the amount of memory available for other game assets, such as graphics and sound.

On the one hand, optimizing for code size can result in faster performance, as smaller code requires fewer instructions to execute. However, this can make the code less readable and harder to maintain. Additionally, smaller code can also be more difficult to debug, as it may require advanced debugging tools to identify and fix issues.

On the other hand, optimizing for code readability can make the code easier to understand and maintain. This is particularly important in game development, where teams of developers may work on the same codebase for years. However, readable code can also be larger and slower, as it may require more instructions to execute.

To balance these trade-offs, developers of retro video games must carefully consider their target hardware and the specific requirements of their game. They need to optimize their code for size and speed, while still ensuring that it remains readable and maintainable.

One common strategy for achieving this balance is to use a combination of assembly language and higher-level programming languages, such as C or C++. Developers can write critical game code in assembly language to achieve optimal performance, while using higher-level languages for less performance-critical tasks. This approach allows developers to optimize their code for size and speed where it matters most, while still maintaining code readability and ease of maintenance.

In conclusion, the trade-off between code size, speed, and readability is a crucial consideration for developers of retro video games using assembly language. By carefully balancing these factors, developers can create efficient, high-performance games that remain readable and maintainable over the long term. Ultimately, the key to success is to understand the specific requirements of the game and the target hardware, and to choose the best approach based on these factors.

Code Goal

The "List_Init" function initializes a double linked list. It takes the list header pointer in register IX and the list data pointer in register HL. The function does modify registers A, BC, DE, and IY.

The double linked list header contains 12 bytes, where the first 10 bytes represent two linked lists, each consisting of 5 bytes. To initialize the linked list, the first 10 bytes of the header must be set to zero, and the 11th and 12th bytes must be set to the number of nodes in the list and the size of each node, respectively.

double linked list header structure with the size of each field — Double linked list header structure

These three code snippets all relate to initializing a double linked list.

#1 Smallest

No alt text provided for this image — Smallest and Slowest Initialization Code

#2 Fastest

#3 Most Readable

The Smallest the Fastest and the Readable

The scale used is a 1-5 scale, where 1 represents the worst and 5 represents the best. The scale was used to categorize the snippets of code based on their size, performance, and readability/maintainability. A score of 1 indicated that the code was poor in that category, while a score of 5 indicated that the code was excellent in that category.

Smallest: In terms of size, the first code snippet is the smallest, using only 16 bytes of memory. However, this comes at the cost of performance since it uses a combination of stack manipulation and slow LDIR instructions to copy data.
Fastest: The second code snippet is the fastest, taking only 148 clock cycles to execute. It uses a combination of direct memory access and the EX instruction to quickly copy data to the desired memory locations.
Most Readable: The third code snippet is the most readable, using clear and concise code with descriptive labels. It is also more flexible since it uses indexed addressing to access the desired memory locations, making it easier to modify and maintain.

Summary

Overall, the choice between these snippets will depend on the specific requirements of the application. If memory usage is a concern, the first snippet may be the best choice. If performance is a priority, the second snippet would be the preferred option. However, if readability and maintainability are important, the third snippet would be the best choice.

The Art of Writing Efficient Assembly Code: A Balancing Act Between Performance, Size, and Readability

Joao Cardoso

Azure Developer

Code Goal

#1 Smallest

#2 Fastest

The Smallest the Fastest and the Readable

Summary

More articles by this author

Insights from the community

Others also viewed

C++ Quest: Mastering C++ OOP Through Game Development

Exploring Smart Pointers in C++

Polymorphism is not just turning someone into a sheep.

Open source game engine - Godot

ALX MAZE PROJECT

Declarative Digital Instrument GUIs with Lua in Audulus - Part 1: Grid Controller

Avoiding Massive View Controllers by refactoring.

C++ Templates: The Cubism of Code

Unreal Engine Programming with Verse

5 Best Programming Languages for Game Development in 2023

Explore topics

Code Goal

#1 Smallest

#2 Fastest

The Smallest the Fastest and the Readable

Summary

Mastering Trigger Management in Assembly: Efficient Solutions for Collision Tracking

Jul 6, 2023

Mastering Double Linked Lists: Efficient Object Removal Techniques

May 30, 2023

Double Linked List Add node

May 16, 2023

Double linked list initialization

May 2, 2023

Introduction to Implementing Double Linked Lists in Assembly Language

Apr 22, 2023

Insights from the community

Others also viewed

C++ Quest: Mastering C++ OOP Through Game Development

Exploring Smart Pointers in C++

Polymorphism is not just turning someone into a sheep.

Open source game engine - Godot

ALX MAZE PROJECT

Declarative Digital Instrument GUIs with Lua in Audulus - Part 1: Grid Controller

Avoiding Massive View Controllers by refactoring.

C++ Templates: The Cubism of Code

Unreal Engine Programming with Verse

5 Best Programming Languages for Game Development in 2023

Explore topics