Add comprehensive x64 code generation for type conversions and aggregate operations by medvednikov · Pull Request #26618 · vlang/v

medvednikov · 2026-02-16T23:53:26Z

Summary

This PR significantly expands x64 code generation capabilities by adding support for type conversion instructions (sign/zero extension, truncation, floating-point conversions), SSE2 floating-point arithmetic, and complex aggregate operations (struct initialization, field extraction/insertion). It also includes infrastructure improvements for handling large aggregates and proper struct field layout calculations.

Key Changes

Type Conversion Instructions

Added support for .sext, .zext, and .trunc operations with proper x64 encoding
- Sign extension uses movsx for 8/16-bit and movsxd for 32-bit values
- Zero extension uses movzx for 8/16-bit and mov eax, eax for 32-bit
- Truncation masks to target width using and with appropriate immediates

Floating-Point Operations

Implemented SSE2-based floating-point arithmetic (.fadd, .fsub, .fmul, .fdiv, .frem)
Added floating-point conversion instructions (.fptosi, .sitofp, .fptoui, .uitofp)
Proper register allocation using xmm0/xmm1 for operands and results
Remainder operation implemented via division, truncation, multiplication, and subtraction

Aggregate Operations

.struct_init: Initialize structs from field values with proper zero-initialization and field offset handling
.extractvalue: Extract fields from tuples/structs with support for:
- Multi-word fields (nested structs)
- Sized loads for sub-8-byte fields to avoid reading adjacent packed fields
- Register-allocated and stack-allocated aggregates
- Large aggregate pointer indirection
.insertvalue: Insert elements into tuples/structs with full aggregate copying and field updates
.inline_string_init: Create string structs with pointer, length, and literal flag fields

Infrastructure Improvements

Added type_align() function for proper struct field alignment calculations
Added struct_field_offset_bytes() for accurate field offset computation
Added large_struct_stack_value_is_pointer() and large_aggregate_stack_value_is_pointer() helpers to distinguish between inline and pointer-based large aggregate storage
Added support for empty function bodies (emit minimal ret stub for functions like __v_init_consts)

New Assembly Helpers (asm.v)

Sized load instructions: movzx for bytes/words, mov for dwords
Generic load/store with displacement: asm_load_reg_base_disp(), asm_store_base_disp_reg()
LEA instruction: asm_lea_reg_rbp_disp()
Shift and bitwise operations: shr, and
Sign/zero extension: movsx, movsxd, movzx
SSE2 instructions: movq, addsd, subsd, mulsd, divsd, roundsd, cvttsd2si, cvtsi2sd

Linker Improvements (ARM64)

Expanded force_external_syms whitelist with comprehensive C library function coverage
Changed symbol resolution strategy to use whitelist-only approach for GOT/stubs
Prevents V-mangled names from leaking into dyld binding

Builder Changes

Modified function building logic to include non-main-module functions in SSA generation

https://claude.ai/code/session_01Ke8CDZM7t1edatQevof46F

…dies Three fixes for the ARM64 backend: 1. gen_func stub: use `true` (N_SECT|N_EXT) for add_symbol so empty function stubs are exported like normal functions, not left as local-only symbols that break dyld binding. 2. build_fn guard: remove `|| b.cur_module != 'main'` so all function bodies are compiled, not just main-module ones. The previous guard caused non-main-module functions (e.g. builtin__new_array_from_c_array) to emit zero-return stubs instead of real code, leading to NULL dereferences at runtime. 3. Expand force_external_syms from 31 to ~140 entries covering all C library functions referenced by V's builtins (memory, string, I/O, filesystem, process, threading, dispatch, backtrace). Change linker to ONLY allow force_external_syms through GOT/stubs, replacing the unreliable '__' name-mangling filter. https://claude.ai/code/session_01Ke8CDZM7t1edatQevof46F

…s and missing operations Add empty function stub handling (emit ret for functions with no blocks), struct/tuple operations (extractvalue, struct_init, insertvalue), integer type conversions (sext, zext, trunc), floating-point arithmetic (fadd/fsub/fmul/fdiv/frem, fptosi, sitofp), inline_string_init, and all necessary helper functions (struct_field_offset_bytes, type_align, large_aggregate_stack_value_is_pointer). Also adds SSE2 and sized-load assembly helpers to asm.v. https://claude.ai/code/session_01Ke8CDZM7t1edatQevof46F

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 8438080681

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-02-16T23:59:05Z

vlib/v2/gen/x64/x64.v

+			base_offset := g.stack_map[val_id]
+			struct_offset := base_offset + 8
+
+			// Store str field (offset 0)
+			g.load_val_to_reg(0, str_ptr_id)
+			asm_store_rbp_disp_reg(mut g, struct_offset, rax)


Allocate stack backing before inline_string_init stores

inline_string_init writes a 24-byte string payload at struct_offset := base_offset + 8, but this path assumes g.stack_map[val_id] already points to reserved backing storage. In x64, ordinary instruction values are still given scalar slots, so these stores can overwrite neighboring frame data whenever inline string construction is emitted. Reserve dedicated aggregate storage for this opcode before writing the three fields.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-02-16T23:59:05Z

vlib/v2/gen/x64/x64.v

+		.fptoui, .uitofp {
+			// For now, treat same as signed versions
+			g.load_val_to_reg(0, instr.operands[0])
+			g.store_reg_to_val(0, val_id)
+		}


Perform numeric conversion for fptoui/uitofp ops

This branch currently just loads and stores the operand unchanged, so fptoui and uitofp return raw bit patterns instead of converted numeric values. Any program casting between unsigned integers and floats will produce incorrect results (for example, a f64 -> u64 cast yields IEEE-754 bits, not the integer). These ops need real conversion instructions rather than pass-through copies.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-02-16T23:59:05Z

vlib/v2/gen/x64/x64.v

+						mask := (u32(1) << u32(field_elem_size * 8)) - 1
+						asm_and_rax_imm32(mut g, mask)
+					}


Clear high 32 bits when extracting 4-byte fields

For 4-byte extractvalue fields, the computed mask is 0xFFFFFFFF, then asm_and_rax_imm32 is used. and rax, imm32 sign-extends the immediate, so 0xFFFFFFFF becomes -1 and the operation does not clear upper 32 bits. In register-packed tuple/struct extraction this can leak stale high bits into the result; use a true 32-bit zero-extension path instead.

Useful? React with 👍 / 👎.

claude added 2 commits February 16, 2026 21:25

chatgpt-codex-connector bot reviewed Feb 16, 2026

View reviewed changes

medvednikov force-pushed the master branch 2 times, most recently from cdf4550 to 4dc97d9 Compare March 22, 2026 10:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add comprehensive x64 code generation for type conversions and aggregate operations#26618

Add comprehensive x64 code generation for type conversions and aggregate operations#26618
medvednikov wants to merge 2 commits intomasterfrom
claude/fix-arm64-empty-functions-Vyb7h

medvednikov commented Feb 16, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Feb 16, 2026

Uh oh!

chatgpt-codex-connector bot Feb 16, 2026

Uh oh!

chatgpt-codex-connector bot Feb 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

medvednikov commented Feb 16, 2026

Summary

Key Changes

Type Conversion Instructions

Floating-Point Operations

Aggregate Operations

Infrastructure Improvements

New Assembly Helpers (asm.v)

Linker Improvements (ARM64)

Builder Changes

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Feb 16, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot Feb 16, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot Feb 16, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants