[GR-48103] Native Image Layers

### TL;DR

Allow users to create native images that depend on a base image, or a chain of base images. Such *application images* are much faster to build compared with standalone images, providing an improved development experience. Moreover, base images can be shared not just across applications but potentially also across different operating system processes, which can reduce the overall memory footprint.

### Motivation

Up until today, users can only build standalone native images, which are single executables (or shared libraries) containing all machine code and an object heap for a given application. While standalone images can benefit from closed-world optimizations, the build step always needs to analyze and compile the full application, including its dependencies, the JDK, and the Substrate VM runtime system.

Layered native images will work similarly to image layers in the container world: Certain components of an application, for example a microservices framework and dependencies, can be turned into one or more base images. On top of such base images, application images can be built. Such images are much smaller and faster to build compared with standalone images because only the application code needs to be recompiled. Everything that is part of the base images can be re-used and does not need recompilation.

Although base images cannot benefit from closed-world optimizations anymore and will thus be larger in size, they provide another benefit: Since multiple applications can use the same base images, base images can be shared, at least partly, across operating system processes, which can reduce the overall memory footprint when deploying multiple application images on the same machine.

### Goals

* Reduce image build time for the application (not for the layered images themselves, or the total time necessary to build all layer dependencies): Allow incremental building of native images where a base image for the core dependencies of the application is pre-built and for application code changes, only the application code needs to be considered.
* Reduce the combined RSS memory size when many different applications that share common base frameworks and libraries run on the same machine.
* Provide a memory efficient and secure integration into [GraalOS](https://www.graal.cloud/graalos/), but also work outside of GraalOS (at first on Linux, eventually also on MacOS and Windows).
* Guarantee the same programming model restrictions as standalone images. We want to be able to "upgrade" a layered application image to a standalone image in GraalOS when that application is frequently used.

### Non-Goals

* Layered images are no general purpose standard shared libraries that can be used from other languages (like from C).
* No interchangeability of images: We do not allow building different base images and mix / match different such images at run time. When building an application, the base images it depends on are fixed at build time. When a base image is rebuilt, all applications that depend on the base image need to be rebuilt too. This includes minor bugfixing releases of a framework or the JDK.
* No overall build time improvements: We accept that the sum of the image build time of the base images and the application image will be higher than the current overall build time.
* No overall image size improvements: The size of the base images plus the application image will be higher (probably 2.5x to 5x) than the current image size.

### Success Metrics

* Reduce the binary size of a Spring/Micronaut/Quarkus application image to at most a few MBytes.
* Reduce the total RSS of all applications when many applications are running because machine code and image heap of base images can be shared between different applications.
* Keep the peak performance acceptable, at most ~20% reduced peak performance (compared to a standalone native image). Some loss of peak performance will be inevitable because closed-world optimizations are no longer possible.

---

### Technical Details

#### Terminology

The current approach of building a single image that contains the application, libraries, the JDK, and the runtime system is called a "standalone image".

In the new approach, an application is split up into multiple "layered images", with a linear chain of dependencies. The layered image that one particular image is based on is called its "base image". Due to the linear chain, a base image can have a base image itself. An image that contains (at least) the java.base module of the JDK and the Substrate VM runtime system cannot depend on a base image. The layered images that depend on one particular image are called "extension images". Extension images can have extension images themselves. A leaf image for an actual application, i.e., an image that can actually be executed, is called an "application image". Slightly different rules (like doing closed-world optimizations) apply to application images because they are no longer extensible.

#### Workflow

Input when building a layered image:

* The classpath of the library/framework (or empty when building a layered image that only contains JDK classes).
* A list of packages/classes that should be available in the layered image. All transitive dependencies from those packages/classes are included in the layered image too.
* A single base image that the layered image depends on (the base image itself can depend on another base image, allowing a linear list of base image dependencies)

Output when building a layered image that is not an application image:

* The actual layered native image, containing the machine code and the image heap. This native image is not a valid shared library according to the normal OS linking and ABI, i.e., it cannot be used from anything else than native images that are based on it.
* A layer archive file that contains additional information necessary to build extension images. In addition to the .jar files (for consistency checking), the layer archive contains information about the image heap and the Graal IR graphs of the classes included in the layered image.

The application image itself that depends on other layered images needs to be a valid application that can be loaded by the OS.

#### Programming model

We want to be able to "upgrade" a layered application image to a standalone image in GraalOS when that application is frequently used. This means that a layered application image must behave exactly the same way as a standalone image, i.e., have the same restrictions regarding reflection and resource usage. We cannot just make everything available for reflection, or include all resources, in base images - if over-registration happens in the base image (to have more sharing for different applications), then we must have a filtering at run time that still adheres to the full restrictions that a standalone image has.

This also applies to features configured via options. For example, we might over-configure a base image to be prepared for all URL protocols or all locales, but options such as `--enable-url-protocols=http,https,ftp,jar` and `-H:+IncludeAllLocales` can only be provided by the final application build, so the over-configured protocols and locales from the base image must not be visible to the application.

Some major changes to the programming model of Native Image are necessary: we must no longer rely on reachability information during the image build. All places where reachability information is exposed to the user must be changed:

Currently, conditional configuration based on the reachability of classes is supported in reflection and resource configuration files. This needs to be deprecated, and replaced with a conditional configuration based on "class has been reached at run time". The information which code has already been executed at run time is stable between layered images and standalone images, i.e., even layered images include more elements than necessary at image build time, the run-time behavior is still the same as a standalone image.

We also need to deprecate (or at least discourage using) the parts of the `Feature` mechanism where users can write code that runs at image build time and reacts to classes, methods, or fields becoming reachable.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[GR-48103] Native Image Layers #7626

TL;DR

Motivation

Goals

Non-Goals

Success Metrics

Technical Details

Terminology

Workflow

Programming model

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[GR-48103] Native Image Layers #7626

Description

TL;DR

Motivation

Goals

Non-Goals

Success Metrics

Technical Details

Terminology

Workflow

Programming model

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions