Fact-checked by Grok 2 weeks ago

Cython

Cython is an optimizing static compiler for both the Python programming language and the extended Cython programming language, which is a superset of Python designed to facilitate the creation of high-performance C extensions for Python.^[1] It originated as an evolution of Pyrex, a language created by Greg Ewing in 2002 to simplify writing Python extension modules in C, and the Cython project itself was initiated in 2007 by a group of developers including Robert Bradshaw and William Stein, who merged enhancements from various projects such as SageMath.^[2]^[1] The primary purpose of Cython is to bridge the gap between Python's ease of use and C's execution speed, enabling developers to wrap external C libraries, embed the CPython interpreter, and accelerate computationally intensive Python code by adding static type declarations and direct C/C++ interop.^[1] Key features include support for Python 3.8 and later (with compatibility for Python 2.7 and 3.5+ in earlier versions), generation of efficient C code that compiles to extension modules, and experimental support for CPython's Limited API and free-threading in version 3.1 and beyond.^[1] It also extends Python syntax with C-style declarations, allowing for fine-grained control over memory management and performance optimizations without sacrificing Pythonic readability.^[1] Cython has become a cornerstone in the scientific computing and machine learning ecosystems, powering libraries such as SciPy for numerical computations, scikit-learn for machine learning algorithms, and spaCy for natural language processing, where it delivers significant speedups—often orders of magnitude faster than pure Python implementations.^[1] Its active development, led by core contributors like Stefan Behnel and Robert Bradshaw, continues to enhance compatibility with modern Python features and alternative interpreters like PyPy.^[2]

Fundamentals

Definition and Purpose

Cython is an optimizing static compiler for both the Python programming language and the extended Cython programming language, which generates C code that can be compiled into efficient extension modules for the CPython interpreter.^[1] As a superset of Python, it allows developers to write code that is largely compatible with standard Python syntax while incorporating C-like features for performance enhancement.^[3] The primary purpose of Cython is to enable Python programmers to achieve near-C-level performance without fully abandoning Python's productivity advantages, by facilitating optional static typing, direct interoperation with C and C++ code, and the creation of high-speed extensions.^[1] This addresses key limitations of pure Python, such as interpreter overhead from dynamic typing and function calls, allowing for substantial speedups— for instance, adding static types can yield orders-of-magnitude improvements over interpreted Python in compute-intensive loops.^[4] Compared to writing extension modules directly in C, Cython simplifies the process by reducing boilerplate and error-prone manual memory management. Key benefits include seamless wrapping of existing C libraries for use in Python applications, embedding of the CPython interpreter into C programs, and overall acceleration of numerical or data-processing tasks common in scientific computing.^[1] Cython supports CPython 3.8 and later versions as its primary runtime, with experimental compatibility for alternatives like PyPy.^[5] Originating as a successor to the Pyrex project, it builds on that foundation to provide more robust Python integration.^[1]

Relationship to Python and Pyrex

Cython is a superset of the Python 3 programming language, meaning that all valid Python 3 code is also valid Cython code and can be executed unchanged within a Cython context.^[1]^[6] This design ensures seamless integration with existing Python codebases while introducing optional C-inspired extensions, such as static type declarations, to enable performance optimizations through compilation to C. Cython originated as a fork of Pyrex, a language developed by Greg Ewing and first released in April 2002, which aimed to simplify the creation of Python extension modules in C.^[7]^[2] While Pyrex provided a Python-like syntax for generating C code to interface with the Python interpreter, it had limitations, including incomplete support for C++ and no native compatibility with Python 3.^[8] Cython evolved from Pyrex to address these shortcomings, enhancing features like C++ integration and full Python 3 support, thereby positioning itself as a more robust tool for high-performance extensions without sacrificing Python's core usability.^[7] In terms of compatibility, Cython source files, typically with the .pyx extension, can directly import and extend standard Python modules, allowing developers to mix dynamic Python features with compiled C elements in the same project.^[9] Upon compilation, Cython generates C code that leverages the Python/C API to embed and interact with the Python interpreter runtime, preserving access to Python's dynamic behaviors such as object-oriented programming and exception handling even in performance-critical sections.^[10] This hybrid approach ensures that Cython modules function as drop-in replacements for pure Python code within larger applications. Unlike standard Python, which relies solely on interpretation, Cython introduces compiler directives to fine-tune behavior, such as # cython: language_level=3 to explicitly set compatibility with Python 3 semantics and avoid ambiguities across versions.^[11] These directives, which can also be specified via command-line options or build configurations, enable precise control over aspects like bounds checking and type inference, further bridging the gap between Python's flexibility and C's efficiency.^[9]

Historical Development

Origins and Early Influences

Cython traces its roots to Pyrex, a programming language developed by Greg Ewing in 2002 to streamline the process of writing Python extension modules. Pyrex allowed developers to combine Python-like syntax with C data types and operations, compiling the result into efficient C code that interfaces directly with Python's C API. This approach addressed the inherent complexities and boilerplate code required when manually crafting C extensions, particularly for performance-critical applications such as numerical computations where direct API manipulation proved cumbersome and error-prone.^[12]^[2] The motivation behind Pyrex stemmed from the growing demand in scientific computing for tools that could accelerate Python's execution without abandoning its ease of use. In domains like mathematics and data analysis, pure Python's interpreted overhead often resulted in unacceptably slow performance for compute-intensive tasks, such as simulations or large-scale array operations. Projects in this space, including early efforts in computer algebra systems, highlighted the need for a bridge between Python's high-level abstractions and C's low-level efficiency, making Pyrex a valuable asset for integrating legacy C libraries and optimizing bottlenecks.^[13]^[14] By 2007, limitations in Pyrex—such as restricted support for C++ features and lack of compatibility with the emerging Python 3—prompted a fork by developers associated with the SageMath project, leading to the birth of Cython. This divergence was driven by the necessity to support more advanced language integrations and broader Python ecosystem evolution, while preserving Pyrex's core philosophy of static compilation for speed gains in scientific workflows. The fork enabled ongoing enhancements tailored to the demands of high-performance computing, setting Cython apart as a more versatile superset of Python.^[15]^[16]

Key Milestones and Contributors

Cython originated as a fork of the Pyrex project in 2007, initiated primarily to support the development needs of the SageMath mathematical software system, with early enhancements merging community patches for better Python compatibility and performance.^[2] The first stable release, version 0.11, arrived in March 2009, marking a significant step forward with improved error handling during compilation and initial support for C++ class integration, which facilitated more robust extension module development. This release solidified Cython's divergence from Pyrex by emphasizing Pythonic syntax while enabling direct C-level optimizations. Subsequent milestones advanced Cython's alignment with evolving Python ecosystems. Version 0.14, released in 2011, introduced full support for Python 3 syntax and semantics, allowing seamless compilation across Python 2 and 3 runtimes and easing the transition for users adopting the newer Python version.^[17] A major overhaul came with version 3.0 in July 2023, which dropped compatibility with Python 2 entirely and refocused on modern CPython implementations starting from Python 3.8, incorporating updates like exception propagation and vectorcall protocol support to enhance runtime efficiency.^[17] Building on this, version 3.1, released in May 2025, added experimental support for free-threading in CPython 3.13 and the Limited C API, enabling better compatibility with alternative Python implementations and reducing ABI dependencies.^[1]^[17] Version 3.2.0, released on November 5, 2025, further improved support for the Limited API and free-threading in CPython, along with other optimizations.^[17] As of November 2025, the latest stable release is 3.2.1, issued on November 12, 2025, which includes bug fixes building on the enhancements in 3.2.0 for Limited API and free-threading support.^[18] Key contributors have driven Cython's evolution since its inception. Stefan Behnel has served as the lead maintainer since 2007, overseeing core language design and optimization efforts.^[2] Robert Bradshaw played a pivotal role in early integration with SageMath, while Dag Sverre Seljebotn enhanced NumPy array handling through Google Summer of Code funding.^[1]^[2] Other notable figures include Lisandro Dalcín for Python 3 advancements and Vitja Makarov for expanded Python feature coverage.^[2] Funding from organizations such as Google, Enthought, and Tidelift has supported these developments, including targeted improvements for scientific computing libraries.^[1] The project boasts a vibrant community, with over 496 contributors on GitHub as of late 2025, reflecting broad collaboration. Cython was integrated into SageMath by 2008, becoming a foundational tool for its high-performance mathematical computations.^[2]

Core Features

Syntax and Semantic Extensions

Cython extends the Python programming language, supporting both traditional .pyx files as a superset and pure Python mode for .py files. It parses source using Python's grammar for dynamic features—such as duck typing and runtime polymorphism—alongside static C-like elements in .pyx mode or type annotations in pure mode for enhanced performance and low-level control.^[19]^[20] This design allows seamless integration of Python code with C constructs, where standard Python syntax remains valid; in .pyx files, new keywords enable C-level operations, while in pure mode, PEP 484/526 type hints provide static typing without syntax extensions.^[21] Key syntactic extensions in .pyx mode include the cdef keyword, which declares C-level variables, functions, and structs directly in the source code, bypassing Python's dynamic dispatch for faster execution and direct memory access.^[22] For functions that need to be accessible from both Python and C contexts, the cpdef keyword provides a hybrid declaration, allowing calls from Python space while using C calling conventions internally for efficiency.^[23] Additionally, the nogil directive can be applied to functions or code blocks to indicate that they do not require the Python Global Interpreter Lock (GIL), enabling safe execution in multithreaded environments without explicit GIL management, including compatibility with free-threaded Python as of version 3.1.^[24]^[1] The cimport directive further supports modularity by importing declarations from .pxd files or C header files as if they were Python modules, facilitating the reuse of C definitions across Cython modules.^[25] Semantically, Cython maintains Python's automatic memory management through reference counting for Python objects passed as parameters or returned from functions, ensuring compatibility with Python's garbage collection model.^[26] However, it permits optional manual memory handling using C standard library functions like malloc and free for raw C types, allowing developers to opt into lower-level control when needed.^[27] For C++ integration, Cython introduces cdef cppclass within cdef extern from blocks to declare and wrap C++ classes, exposing their methods and attributes to Cython code while handling constructors, namespaces, and exceptions appropriately.^[28] Compiler directives such as language_level enforce specific Python version semantics—such as treating strings as Unicode in Python 3 mode—affecting type interpretation and compatibility without altering the core syntax.^[11] Extension types, declared using cdef class, represent a semantic extension to Python's class system by implementing them as C structs with vtables, which provides contiguous memory layout for attributes and accelerates access compared to Python's dictionary-based storage.^[29] These types support single inheritance and allow Python subclasses to override methods, blending object-oriented Python semantics with C's structural efficiency.^[29]

Type Declarations and Optimizations

In .pyx files, Cython employs the cdef keyword to declare variables, functions, and types with static semantics, enabling direct mapping to C constructs and bypassing Python's dynamic type system; additionally, in pure Python mode, static typing can be achieved using standard Python type annotations from the typing module. This includes support for fundamental C types such as int, float, double, char, and void*, as well as pointers like int* and arrays such as int[10]. For Python interoperability, declarations can reference objects using object or PyObject*, allowing seamless handling of Python data structures within statically typed contexts.^[30]^[20] Memoryviews provide an efficient mechanism for accessing array-like data structures, declared using syntax like double[:] for one-dimensional views or int[:,:] for multi-dimensional ones, integrating with the Python buffer protocol (PEP 3118) to support NumPy arrays and other buffers without copying data. This facilitates zero-overhead access to contiguous memory regions, with optional bounds checking that can be disabled via compiler directives like boundscheck=False to further enhance performance in inner loops.^[31] Static type declarations optimize code by eliminating runtime type checks, object boxing, and dynamic dispatch, allowing the compiler to generate direct C calls and inline operations, which can result in significant speedups—often by orders of magnitude in compute-intensive sections. For instance, typing loop variables and accumulators as C types transforms Python loops into equivalent C loops, reducing overhead from Python's interpreter. Fused types, declared with cython.fused_type(), extend this by enabling generic functions over sets of compatible types (e.g., numeric types like int and double), where the compiler specializes implementations at compile time for optimal performance without runtime type resolution. In practice, such optimizations have demonstrated up to 150x speedups for simple typed functions compared to pure Python equivalents.^[32]^[33]

Building and Deployment

Compilation Process

The compilation of Cython code occurs in two primary stages: first, a source file with a .pyx extension (or .py for pure Python code) is processed by the Cython compiler to generate an equivalent C file, and second, that C file is compiled by a standard C compiler into a platform-specific shared library, such as .so on Linux and macOS or .pyd on Windows.^[9] This workflow enables Cython modules to function as efficient Python extension modules, leveraging the Python C API for interoperability.^[9] To initiate compilation from the command line, developers typically use the cythonize script, which is included with Cython installations and automates both the translation to C and the subsequent compilation to a shared library when provided with appropriate compiler flags.^[9] For more complex projects, integration with Python's packaging tools is recommended; a setup.py file can employ setuptools or distutils by importing cythonize from Cython.Build and specifying extension modules, as in ext_modules = cythonize("example.pyx"), followed by invoking setup() with the build_ext command.^[9] This approach handles dependencies and ensures the generated C code includes the necessary #include <Python.h> directive to embed access to the Python interpreter and its C API.^[9] Compiler directives allow fine-tuned control over the compilation process, influencing optimizations applied during the Cython-to-C translation stage.^[9] These directives, such as boundscheck=False to disable array bounds checking or wraparound=False to skip negative indexing adjustments, can be specified in special comments at the top of .pyx or .pxd files (e.g., #cython: boundscheck=False), passed via the command line with cython -X boundscheck=False, or set globally in setup.py through the cythonize function's compiler_directives parameter.^[9] Type declarations, as discussed in the Core Features section, further guide the compiler in generating optimized C code by enabling static typing where applicable.^[9] Cython's build process supports cross-platform development across Windows, Linux, and macOS, provided that a compatible C compiler is available—such as GCC or Clang on Unix-like systems and MSVC on Windows—and the Python development headers are installed to provide access to the Python C API.^[9] These headers, typically included in Python's "dev" or "devel" packages (e.g., python3-dev on Debian-based Linux distributions), are essential for linking against the Python interpreter during the final C compilation step.^[9] Starting with version 3.0, Cython incorporates a new frontend parser, which enhances compatibility with modern Python features and contributes to improved compilation performance in subsequent releases.^[17] As of November 2025, the latest stable release is Cython 3.2.0, which includes further optimizations to the build process.^[18]

Generated Code and Runtime Behavior

The Cython compiler translates source code into C code that forms a standard Python extension module, incorporating calls to the Python C API to interface with the Python runtime. For functions declared with def, which are visible to Python code, the generated C includes boilerplate to parse arguments using functions like PyArg_ParseTuple and convert them to appropriate types, ensuring compatibility with Python's dynamic calling conventions. In contrast, sections with static type declarations, such as cdef functions or typed variables, generate direct C operations without Python object overhead, bypassing API calls for those parts to enable efficient computation. The resulting module exports an initialization function, typically named PyInit_<module_name>, which Python's import mechanism invokes to register the module's contents, such as functions and classes, into the Python namespace.^[34]^[9] At runtime, compiled Cython extensions load seamlessly as Python modules via the standard import statement, integrating with the Python interpreter as if they were pure Python code. Memory management in these modules relies on Python's reference counting for objects interacting with Python code, while C-level allocations in typed sections use manual calls to functions like PyMem_Malloc and PyMem_Free or standard C malloc/free to avoid garbage collection overhead where possible. Cython supports releasing the Global Interpreter Lock (GIL) using the nogil context manager or function qualifier, allowing parallel execution in multithreaded scenarios, such as within OpenMP directives or prange loops, provided the code avoids Python API calls that require the GIL. This enables true concurrency in CPU-bound tasks but requires careful design to prevent race conditions on shared Python objects.^[35]^[36]^[24] Performance benefits arise from eliminating Python's interpreter overhead in compiled sections, where typed loops and operations execute as native C code, often yielding speedups of 10x to 100x over equivalent pure Python for numerical tasks. However, the runtime retains Python's garbage collector for any Python objects, which can introduce pauses in long-running computations, and untyped code still incurs API call costs. Cython-annotated pure Python code, compilable directly from .py files, provides partial speedups by optimizing only the typed portions while preserving full Python compatibility. In free-threading mode, available in Python 3.13 and later (officially supported starting with Python 3.14) without the GIL, Cython extensions can achieve true thread parallelism, enhancing scalability on multicore systems.^[37] Additionally, Cython's support for the Python Limited API ensures binary compatibility across minor Python versions by restricting use of stable C API subsets, reducing ABI breakage risks during upgrades.^[34]^[38]^[39]

Practical Usage

Writing and Executing Code

Cython code is typically written in source files with the .pyx extension, which can contain a mix of Python code and Cython-specific extensions for performance optimization. For declaring pure-mode C interfaces or external C library headers without implementation details, separate declaration files with the .pxd extension are used, allowing modular separation of interfaces from implementations. The basic structure of a .pyx file includes standard Python imports for modules, def statements for functions that are directly callable from Python with full Python object semantics, and cdef statements for internal C-level functions that offer faster execution but require wrappers to expose them to Python code.^[9]^[30]^[30] To illustrate, consider a simple addition function in a file named add.pyx:

cython
cdef int add(int a, int b):
    return a + b

def py_add(a, b):
    return add(a, b)
cdef int add(int a, int b):
    return a + b

def py_add(a, b):
    return add(a, b)

Here, add is a cdef function optimized for C-level speed, while py_add is a def wrapper that makes it accessible from Python. This setup allows type declarations like int to enable static typing, reducing Python overhead.^[30]^[32] Once written, Cython modules are compiled to extension modules (.so on Unix-like systems or .pyd on Windows) before execution. Compilation can be performed directly using the command python -m cythonize -i add.pyx, which translates the .pyx file to C and builds the extension in place. Alternatively, a setup.py file can be used with from setuptools import setup; from Cython.Build import cythonize; setup(ext_modules=cythonize("add.pyx")), followed by python setup.py build_ext --inplace for building. The resulting module is then imported and executed like any Python module, for example, in a script test.py: import add; print(add.py_add(3, 4)). Cython supports incremental compilation during development, recompiling only modified files unless forced otherwise, and reports errors via Python-like tracebacks that include C line numbers when enabled.^[9]^[9]^[9] To demonstrate performance benefits, compare the Cython version against pure Python. A pure Python equivalent might be def py_add(a, b): return a + b. When calling py_add(1, 2) a million times in a loop, the pure Python version takes approximately 50-100 milliseconds, while the typed Cython version completes in under 1 millisecond, yielding a speedup of over 50 times due to eliminated type checking and object overhead. This gain is more pronounced in loops or numerical computations, though simple single calls show modest improvements.^[32]^[32]

Integration with Tools and Ecosystems

Cython provides robust integration with interactive computing environments, particularly Jupyter and IPython, facilitating seamless development and execution of Cython code within notebooks. To enable this, users install Cython via pip or conda and load the extension using the %load_ext Cython command in a notebook cell. The %%cython magic then allows direct compilation and import of Cython code blocks, supporting features like type declarations (e.g., cdef int i) and inline annotations for performance analysis. For instance, a simple loop can be compiled on-the-fly, demonstrating speedups without leaving the notebook interface. This workflow supports rapid iteration, with options like %%cython --annotate to generate HTML visualizations of the compiled C code for optimization insights.^[35] In scientific computing ecosystems, Cython excels in compatibility with libraries such as NumPy and SciPy, enabling efficient array operations and extension building. Integration with NumPy occurs through cimport numpy, which grants access to the NumPy C API for declaring typed memoryviews (e.g., double[:, :]) that provide zero-overhead access to array data, bypassing Python object overhead. This is particularly useful for numerical algorithms, where memoryviews can yield performance comparable to native C code. For SciPy, Cython files (.pyx) are incorporated into the build system via Meson, allowing developers to write performance-critical components like special functions or optimizers in Cython while leveraging SciPy's Python interface. Cython's support for fused types further enhances generality across NumPy dtypes, such as integers and doubles.^[40]^[41] Cython modules are packaged and distributed using standard Python tools, ensuring broad accessibility. The setuptools library, combined with Cython.Build.cythonize, automates compilation of .pyx files into extension modules during installation via pip install or building wheels for binary distribution. This process supports platform-specific optimizations and dependency management, with wheels enabling quick deployment without requiring users to have Cython or compilers installed. For environments like Conda, Cython is available through the conda-forge channel, allowing binary installations and packaging of Cython-based projects into conda recipes for reproducible scientific workflows. Continuous integration tools, such as GitHub Actions, streamline builds by automating wheel creation and testing across platforms using actions like cibuildwheel.^[9]^[42]^[43]^[44] Integrated development environments (IDEs) enhance Cython development through syntax highlighting, code completion, and error detection for .pyx files. PyCharm offers native support, recognizing Cython syntax and providing features like navigation to definitions and integration with its debugger for stepping through compiled code. In Visual Studio Code, the vscode-cython extension delivers syntax highlighting and static checking, making it suitable for lighter-weight editing. These tools facilitate collaborative development while maintaining compatibility with Python workflows.^[45]^[46] Debugging compiled Cython modules is supported through tools like GDB via the cygdb wrapper, which enables setting breakpoints in Cython source lines (e.g., cy break function:line), stepping through code, and inspecting variables with Python-aware commands. Compilation with debug flags (e.g., cython --gdb) embeds symbols for this purpose, allowing integration with Python's pdb for hybrid debugging sessions where Python calls trigger Cython execution. This setup is essential for troubleshooting performance issues in extension modules.^[47] Cython 3.x introduces enhancements tailored for NumPy interoperability, including the @cython.ufunc decorator for rapidly generating NumPy universal functions from scalar Cython functions, supporting vectorized operations across arrays without manual loop unrolling. Typed memoryviews receive further optimizations for array manipulations, such as contiguous slicing and fused type compatibility, improving efficiency in numerical tasks. These features, combined with Conda's binary distribution capabilities, make Cython 3.x a cornerstone for modern scientific Python ecosystems.^[17]^[31]

Advanced Topics

Interfacing with C/C++ Libraries

Cython facilitates the integration of existing C libraries by allowing developers to declare and call external C functions and variables directly within Cython code, enabling seamless wrapping without the overhead of Python's C API in many cases.^[48] This is primarily achieved through the cdef extern from directive, which declares C entities and instructs the compiler to include the corresponding header file in the generated C code.^[48] For instance, to interface with the standard C math library, one can declare the sin function as follows:

cython
cdef extern from "math.h":
    double sin(double x)
cdef extern from "math.h":
    double sin(double x)

This declaration allows direct calls to sin from Cython functions, with the generated code invoking the native C implementation.^[48] To expose such wrapped functions to Python, they can be defined in a def or cpdef function, ensuring compatibility with Python's dynamic typing while maintaining C-level performance.^[49] For more complex libraries, Cython supports the creation of .pxd files to mirror C header declarations, promoting modularity and reuse across modules.^[48] A .pxd file contains cdef declarations without implementation, which can then be imported via cimport in .pyx files. This approach avoids naming conflicts and facilitates the distribution of interface definitions separately from implementations. For example, wrapping a simple C queue library might involve a queue.pxd file declaring the struct and functions like queue_new and queue_push_head, followed by a .pyx file that imports these and wraps them in a Python class using @cython.cclass for memory management via __cinit__ and __dealloc__.^[49] Error handling in such wrappers typically involves checking for null pointers and raising Python exceptions, such as MemoryError or IndexError, to propagate C-level errors idiomatically.^[49] Cython extends these capabilities to C++ libraries, providing native support for C++ features like classes, templates, and the Standard Template Library (STL).^[50] Declarations use cdef extern from with C++ headers, and namespaces are specified directly, such as cdef extern from "lib.hpp" namespace "std".^[50] STL containers are accessible through dedicated libcpp modules, for example, from libcpp.map cimport map as cpp_map, allowing coercion between Python sequences and C++ types like vector[T] or map[Key, Value].^[50] C++ classes can be wrapped using cdef cppclass in .pxd files, enabling the creation of Python extensions that instantiate and manipulate C++ objects, with dynamic allocation via new and deallocation via del.^[50] A key aspect of C/C++ interfacing in Cython is handling callbacks from C code back to Python, which is managed through PyObject* pointers and the Global Interpreter Lock (GIL).^[48] Callbacks declared as cdef void callback(PyObject* data) require explicit GIL acquisition using with gil: to safely interact with Python objects, preventing thread-safety issues.^[48] For templated C++ code, Cython supports parameterization via bracket notation, such as vector[int], and integrates with fused types—compile-time generics in Cython—to generate specialized code that maps to C++ templates, enhancing type safety and performance.^[50] These mechanisms allow Cython to serve as a robust bridge for leveraging optimized C/C++ libraries in Python ecosystems.^[48]

Support for Alternative Runtimes and Features

Cython offers experimental support for alternative Python runtimes beyond the standard CPython interpreter, enabling developers to compile code for environments like PyPy while adapting to their unique behaviors. This compatibility is achieved through adaptations in the generated C code, such as using PyPy's cpyext layer, which emulates parts of the CPython C API but introduces differences in reference counting, object lifetimes, and borrowed references. However, the benefits of Cython's static typing are limited in PyPy due to its just-in-time (JIT) compiler and garbage collection mechanisms, which can reduce the performance gains from type annotations compared to CPython.^[51] For PyPy specifically, Cython compiles modules to C code that is compatible via cpyext, allowing much of the codebase to run unchanged after minor adjustments for PyPy-specific issues, such as non-reentrant GIL handling or performance quirks in functions like PyTuple_GET_ITEM. The support is considered usable with the latest PyPy versions and is tested in Cython's continuous integration, marking it as officially supported in recent releases. A notable improvement came in Cython 3.1.4, which resolved crashes during tracing of C function returns, enhancing reliability for debugging and profiling in PyPy environments.^[51]^[1]^[17] Cython 3.1 and later versions provide experimental compatibility with Python's free-threaded builds, introduced in CPython 3.13, where the global interpreter lock (GIL) is disabled to enable true parallelism. In these builds, Cython's nogil blocks—sections of code declared to run without acquiring the GIL—function without re-enabling the lock, allowing for multi-threaded execution in performance-critical sections. To ensure compatibility, modules can be marked with the # cython: freethreading_compatible = True directive, and testing is recommended using flags like PYTHON_GIL=0. This support leverages Python's critical section API and synchronization primitives for thread safety, though manual locking is often required to avoid race conditions.^[39]^[1] Additionally, Cython supports the Python Limited API and stable ABI starting from version 3.1, allowing extension modules to be built against a fixed subset of the C API for binary compatibility across Python versions from 3.8 onward. By setting the Py_LIMITED_API macro during compilation (e.g., to 0x03080000), modules can use the .abi3.so naming convention and avoid recompilation when deploying to different Python minor versions, though testing across targets is essential due to potential runtime differences. This feature restricts certain advanced usages, such as inheriting from builtin types in cdef classes or full profiling support, but it minimizes overhead for C-level operations while enabling broader distribution.^[38] Cython provides partial support for asynchronous programming features like async/await, primarily in Python-mode functions where coroutines can be defined and used with the asyncio library. Async generators, for instance, require CPython 3.6 or later for proper finalization during cleanup. However, cdef functions have limited coroutine capabilities, as they cannot directly use await expressions; instead, they can construct and return awaitable objects for integration with event loops. This enables efficient async code in Cython but falls short of full C-level coroutine support.^[52]^[53] Compatibility with other Python implementations like Jython and IronPython is partial at best and not recommended for production use, as Cython relies on the CPython C API, which these JVM- and .NET-based runtimes do not fully emulate. While pure Python subsets of Cython code may execute, the generated C extensions fail due to incompatible internals, limiting portability to experimental or legacy scenarios.^[54]

Applications and Impact

Scientific and Numerical Computing

Cython plays a pivotal role in scientific and numerical computing by enabling the acceleration of performance-intensive operations, such as loops over large datasets and array manipulations in simulations, through its compilation to optimized C code. This allows researchers to retain Python's expressiveness while achieving near-native speeds for tasks like finite element simulations or Monte Carlo methods, where pure Python would bottleneck due to interpreter overhead. For example, in numerical simulations involving iterative array processing, Cython can deliver speedups of up to 800 times compared to unoptimized Python code for in-cache matrix multiplications.^[55] A key strength of Cython in this domain lies in its seamless integration with NumPy and SciPy, facilitating the creation of typed universal functions (ufuncs) and efficient distance computations on multidimensional arrays. Typed memoryviews provide zero-copy access to NumPy arrays, eliminating data copying overhead and enabling direct manipulation at the C level, which is essential for memory-efficient processing in large-scale scientific workflows. This integration supports operations like vectorized distance metrics in clustering algorithms, where Cython code can outperform pure NumPy implementations by factors of 9 to 11 times, particularly when combined with contiguous array layouts.^[40] Cython is integral to major projects in scientific computing, including SageMath, where it has been used since 2008 to compile performance-critical mathematical routines for symbolic and numerical computations. In scikit-learn, Cython implements core machine learning algorithms, such as those involving matrix operations for classification and regression. Similarly, spaCy leverages Cython for its NLP pipelines, optimizing numerical aspects like token vector operations and batch processing in language models. These applications highlight Cython's ability to handle 10-1000x speedups in matrix operations, as demonstrated in benchmarks for linear algebra routines.^[56]^[57]^[58]^[55] Advanced techniques in Cython further enhance its utility for numerical computing, including memoryviews for efficient, zero-copy NumPy array access, which minimizes memory bandwidth usage in simulation loops. Parallelization is achieved through OpenMP directives, such as #pragma omp parallel for inserted via Cython's prange construct in the generated C code, allowing multi-threaded execution of independent array operations without the global interpreter lock. Cython is also employed in NumPy's build system via Cython-based distutils extensions for compiling random number generators and other low-level modules. In recent machine learning applications, Cython facilitates GPU acceleration by interfacing with C++ code and libraries like CUDA via its C++ support, enabling faster computations on multi-dimensional arrays.^[40]^[59]^[60]^[61]

Broader Software Development Uses

Cython extends beyond numerical domains into systems programming, where it facilitates the creation of high-performance modules for asynchronous I/O and data processing. For instance, the gevent library, which provides asynchronous networking capabilities through greenlets, compiles key components such as its event handling and queue modules with Cython to achieve enhanced performance over pure Python implementations.^[62] Similarly, lxml, a widely used XML and HTML processing library, is implemented in Cython to wrap the libxml2 and libxslt C libraries, enabling efficient parsing and serialization while maintaining a Pythonic interface.^[63] These applications leverage Cython's ability to generate optimized C code from Python-like syntax, allowing developers to handle I/O-intensive tasks without resorting to low-level C programming. In web development and machine learning pipelines, Cython supports extensions for natural language processing and feature preparation. The spaCy library, an industrial-strength NLP toolkit, is primarily written in Cython to optimize tokenization and linguistic feature extraction, processing text at speeds suitable for production environments.^[64] For machine learning, scikit-learn employs Cython extensively in its core algorithms and utilities, including feature engineering transformers that accelerate data preprocessing steps like cyclical feature extraction for time-series data. Additionally, Cython enables embedding Python interpreters into C applications, allowing hybrid systems where Python scripts handle dynamic logic within performance-critical C backends, such as in server-side tools or custom runtimes.^[10] Overall, Cython's role in broader software development diminishes the reliance on pure C extensions within Python ecosystems by bridging the performance gap, often yielding code that runs at near-C speeds while retaining Python's readability.^[1] This facilitates the development of hybrid applications, including game engines that integrate Python scripting for rapid prototyping and asset management alongside C-based rendering cores. By supporting such integrations, Cython empowers diverse stacks, from web services to embedded systems, to balance developer productivity with runtime efficiency. As of November 2025, Cython's version 3.2 continues to support modern Python features and ecosystems, including ongoing contributions to AI and scientific computing libraries.^[65]

References

[1]
Cython: C-Extensions for Python
### Summary of Cython
[2]
Cython is 20! | Stefans Welt
Apr 4, 2022 · Today, Cython celebrates its 20th anniversary! On April 4th, 2002, Greg Ewing published the first release of Pyrex 0.1.Missing: initial | Show results with:initial
[3]
an overview — Cython 3.2.0 documentation
[Cython] is a programming language that makes writing C extensions for the Python language as easy as Python itself. It aims to become a superset of the [Python] ...
[4]
Faster code via static typing - Cython's Documentation
Cython is a Python compiler. This means that it can compile normal Python code without changes (with a few obvious exceptions of some as-yet unsupported ...
[5]
Cython Changelog — Cython 3.2.0b3 documentation
Cython 3.0.0 has been a very large effort that cleaned up many old warts, introduced many new features, and introduces a couple of intentional behaviour ...
[6]
None
Nothing is retrieved...<|control11|><|separator|>
[7]
FAQ — Cython 3.2.0b3 documentation
What is the relation between Cython and Pyrex?¶. Answer: Cython started originally based on a previous project called Pyrex, developed primarily by Greg Ewing.<|separator|>
[8]
cython/cython: The most widely used Python to C compiler - GitHub
Cython is an optimising Python compiler that makes writing C extensions for Python as easy as Python itself. Cython translates Python code to C/C++ code, ...
[9]
Source Files and Compilation — Cython 3.2.0 documentation
### Summary of Cython as a Superset of Python, Compatibility, .pyx Files, and Directives
[10]
Embedding Cython modules in C/C++ applications
Cython doesn't compile to “pure stand-alone C code”. Instead Cython compiles to a bunch of Python C API calls that depend on the Python interpreter. Therefore, ...
[11]
https://cython.readthedocs.io/en/latest/src/userguide/source_files_and_compilation.html#compiler-directives
[12]
Pyrex
### Summary of Pyrex Origins, Motivation, and Purpose
[13]
[PDF] Cython
Cython's Origins. Greg Ewing is the author of Pyrex, Cython's predecessor. When Pyrex was first released, its ability to speed up Python code by large ...
[14]
Interview with Stefan Behnel Lift your Speed Limits with Cython. Fast ...
Its origins date back to the early 2000s when Greg Ewing came up with the idea of designing a language (called Pyrex) that mixes Python and C to develop native ...
[15]
Cython, Sage, and the Need for Speed
Nov 3, 2010 · History. Greg Ewing wrote "Pyrex" in 2002--2004..., which I guess he ... Pyrex let you write basically Python-ish code that gets ...
[16]
Differences between Cython and Pyrex
Both Cython and Pyrex are moving targets. It has come to the point that an explicit list of all the differences between the two projects would be laborious to ...Missing: origins | Show results with:origins
[17]
Cython Changelog — Cython 3.2.0b3 documentation
Cython 3.0.0 has been a very large effort that cleaned up many old warts, introduced many new features, and introduces a couple of intentional behaviour ...Missing: milestones | Show results with:milestones
[18]
https://github.com/cython/cython/releases
[19]
https://cython.readthedocs.io/en/latest/src/userguide/language_basics.html#the-implementation-file
[20]
https://cython.readthedocs.io/en/latest/tutorial/pure.html
[21]
https://cython.readthedocs.io/en/latest/src/userguide/language_basics.html#declaring-data-types
[22]
Cython and the GIL
A small number of “Python operations” may be performed in a nogil block without needing to explicitly use with gil . The main example is throwing exceptions.Missing: directive | Show results with:directive
[23]
https://cython.readthedocs.io/en/latest/src/userguide/language_basics.html#python-functions-vs-c-functions
[24]
https://cython.readthedocs.io/en/latest/src/userguide/nogil.html
[25]
https://cython.readthedocs.io/en/latest/src/userguide/source_files_and_compilation.html#cython-include-files
[26]
https://cython.readthedocs.io/en/latest/src/userguide/language_basics.html#python-objects-as-parameters-and-return-values
[27]
Extension types (aka. cdef classes) - Cython's Documentation
Cython supports a second kind of class: extension types, sometimes referred to as “cdef classes” due to the Cython language keywords used for their declaration.
[28]
Language Basics — Cython 3.3.0a0 documentation
### Summary of Cython Syntax Extensions and Features
[29]
Typed Memoryviews — Cython 3.3.0a0 documentation
### Summary of Memoryviews in Cython
[30]
Faster code via static typing - Cython's Documentation
They will allow Cython to step out of the dynamic nature of the Python code and generate simpler and faster C code - sometimes faster by orders of magnitude.
[31]
Fused Types (Templates) — Cython 3.3.0a0 documentation
### Summary of Fused Types in Cython
[32]
Basic Tutorial — Cython 3.2.0 documentation
The first line is a compiler directive. It tells Cython to compile your code to C++. This will enable the use of C++ language features and the C++ standard ...
[33]
Building Cython code
There are several ways to build Cython code. Currently, using setuptools is the most common way Cython files are built and distributed.<|control11|><|separator|>
[34]
Memory Allocation — Cython 3.2.0b3 documentation
Dynamic memory allocation is mostly a non-issue in Python. Everything is an object, and the reference counting system and garbage collector automatically ...
[35]
The Limited API and Stable ABI - Cython's Documentation
The Limited API and Stable ABI are two related features of Python. Extension modules that only use a safe subset of the Python C API (the Limited API) get a ...
[36]
Free threading — Cython 3.2.0 documentation
It is intended for sharing locks between modules with the Limited API (since PyMutex is unavailable in the Limited API). Note that unlike the “raw ...Missing: 2024 | Show results with:2024
[37]
Cython for NumPy users
The SAGE mathematics software system provides excellent support for using Cython and NumPy from an interactive command line or through a notebook interface ( ...
[38]
Adding Cython to SciPy — SciPy v1.16.2 Manual
To add Cython to SciPy, use .pyx files and add them to the meson.build configuration of the subpackage. Cython compiles these to C code.
[39]
Installing Cython — Cython 3.2.0 documentation
Unlike most Python software, Cython requires a C compiler to be present on the system. The details of getting a C compiler varies according to the system used:.Missing: loading | Show results with:loading<|control11|><|separator|>
[40]
Cython - Anaconda.org
Cython is an optimising static compiler for both the Python programming language and the extended Cython programming language.
[41]
pypa/cibuildwheel: Build Python wheels for all the platforms ... - GitHub
Builds manylinux, musllinux, macOS, and Windows wheels for CPython, PyPy, and GraalPy · Works on GitHub Actions, Azure Pipelines, Travis CI, CircleCI, GitLab CI, ...Issues 86 · Pull requests 8 · Discussions · Security
[42]
Cython support | PyCharm Documentation - JetBrains
Jul 8, 2025 · PyCharm provides initial Cython support out of the box. PyCharm recognizes .pyx, .pxd, and .pxi files, and allows you to edit them.
[43]
Cython VSCode - Visual Studio Marketplace
Mar 21, 2024 · This extension provides syntax highlighting and a Static Syntax checker for the Cython language. Cython is a superset of Python with it's own C-styled syntax.Missing: IDE PyCharm
[44]
Debugging your Cython program
The Cython debugger comes with a set of commands that support breakpoints, stack inspection, source code listing, stepping, stepping over, etc.Missing: pdb | Show results with:pdb
[45]
Interfacing with External C Code — Cython 3.3.0a0 documentation
### Summary: Cython Code Interaction with Python Modules, Importing, and Embedding Interpreter
[46]
Using C libraries — Cython 3.2.0 documentation
The following gives a complete example for using (and wrapping) an external C library in Cython code, including appropriate error handling and considerations.
[47]
Using C++ in Cython — Cython 3.3.0a0 documentation
### Summary: Support for C++ Classes with `cdef cppclass` in Cython
[48]
Porting Cython code to PyPy
This page lists major differences and ways to deal with them in order to write Cython code that works in both CPython and PyPy.Object Lifetime · Borrowed References And Data... · Efficiency<|control11|><|separator|>
[49]
Cython Changelog — Cython 0.29.37 documentation
C++ object arguments for the constructor of Cython implemented C++ are now passed by reference and not by value to allow for non-copyable arguments, such as ...
[50]
How to await in cdef? - Stack Overflow
Feb 26, 2018 · Cython supports coroutines via yield from and await only for Python functions. Even for a classical coroutine, a cdef is not possible: Error ...Using Cython with Asyncio (Python 3.4) - Stack OverflowCython used with async - Decorators can only be followed by ...More results from stackoverflow.com
[51]
an overview — Cython 3.1.6 documentation
Other major implementations use Java (Jython [Jython]), C# (IronPython [IronPython]) and Python itself (PyPy [PyPy]).
[52]
[PDF] Fast numerical computations with Cython - SciPy Proceedings
We discuss Cython's features for fast NumPy array access in detail through examples and benchmarks. Using Cython to call natively compiled scientific li-.Missing: integration | Show results with:integration
[53]
Coding in Cython - Developer Guide - SageMath Documentation
Cython is a compiled version of Python. It was originally based on Pyrex but has changed based on what Sage's developers needed; Cython has been developed ...
[54]
Cython Best Practices, Conventions and Knowledge - Scikit-learn
This document contains tips to develop Cython code in scikit-learn. Tips for developing with Cython in scikit-learn Tips to ease development Activating checks ...
[55]
spaCy · Industrial-strength Natural Language Processing in Python
spaCy is a free open-source library for Natural Language Processing in Python. It features NER, POS tagging, dependency parsing, word vectors and more.Install spaCy · spaCy 101 · Projects · Training Pipelines & Models
[56]
Writing parallel code with Cython
One method of speeding up your Cython code is parallelization: you write code that can be run on multiple cores of your CPU simultaneously.Missing: numerical computing
[57]
Extending numpy.random via Cython
Starting with NumPy 1.26.0, Meson is the default build system for NumPy. See Status of numpy.distutils and migration advice. meson.build ...
[58]
Accelerating Python on GPUs with nvc++ and - NVIDIA Developer
Nov 10, 2020 · In this post, I explore a way to bring GPU-accelerated C++ algorithms to the Python ecosystem. I use Cython as a way to call C++ from Python and show you how ...Missing: ML tensor operations interop 3.1
[59]
Changes for 1.3 - gevent 25.9.2.dev0 documentation
Oct 12, 2018 · Event and gevent.event.AsyncResult are compiled with Cython for improved performance, as is the gevent.queue module and gevent.hub ...
[60]
How to read the source of lxml
Cython is the language that lxml is written in. It is a very Python-like language that was specifically designed for writing Python extension modules. The ...
[61]
Linguistic Features · spaCy Usage Documentation
After tokenization, spaCy can parse and tag a given Doc . This is where the trained pipeline and its statistical models come in, which enable spaCy to make ...Trained Models & Pipelines · Visualizers · Token