Skip to main content
Nature Communications logoLink to Nature Communications
. 2023 Nov 21;14:7563. doi: 10.1038/s41467-023-42068-x

Unifying the design space and optimizing linear and nonlinear truss metamaterials by generative modeling

Li Zheng 1, Konstantinos Karapiperis 1, Siddhant Kumar 2,, Dennis M Kochmann 1,
PMCID: PMC10663604  PMID: 37989748

Abstract

The rise of machine learning has fueled the discovery of new materials and, especially, metamaterials—truss lattices being their most prominent class. While their tailorable properties have been explored extensively, the design of truss-based metamaterials has remained highly limited and often heuristic, due to the vast, discrete design space and the lack of a comprehensive parameterization. We here present a graph-based deep learning generative framework, which combines a variational autoencoder and a property predictor, to construct a reduced, continuous latent representation covering an enormous range of trusses. This unified latent space allows for the fast generation of new designs through simple operations (e.g., traversing the latent space or interpolating between structures). We further demonstrate an optimization framework for the inverse design of trusses with customized mechanical properties in both the linear and nonlinear regimes, including designs exhibiting exceptionally stiff, auxetic, pentamode-like, and tailored nonlinear behaviors. This generative model can predict manufacturable (and counter-intuitive) designs with extreme target properties beyond the training domain.

Subject terms: Mechanical engineering, Mechanical properties, Engineering


Truss metamaterials are ubiquitous but their vast design space is far from fully explored. Here, authors use machine learning to present a unified, continuous latent space description, enabling the rapid generation of trusses with tunable or exceptional linear and nonlinear mechanical properties.

Introduction

Architected metamaterials are rapidly redefining the boundaries of achievable material properties. Supported by additive manufacturing, the design of such cellular solids with tailored microstructural architecture has led to unprecedented functionality: from counter-intuitive negative compressibility1,2 and negative Poisson’s ratio3,4 to mechanical cloaking5, extreme energy absorption68, and guided acoustic waves9,10. Among the myriad of available design spaces, truss metamaterials-based on periodic lattices of beam networks—have emerged as the dominant one, particularly due to their high stiffness and strength in the ultralow-relative-density regime1116 and the simple manufacturability.

Truss metamaterials offer an extensively tunable design space based on both the lattice topology (i.e., the connectivity of the beam network) as well as geometric features (e.g., the length, orientation, and cross-sectional shape of each strut). However, most of this unlimited design freedom has remained untapped. Many design applications1721 have been limited to a small catalog of ad-hoc lattices (e.g., kagome, octet, and octahedron2228), which have been identified through a combination of intuition and trial-and-error over the years. While the catalog-based search space can be enriched by tuning geometric features29 or base material properties19, it is strongly limited in topological tunability and fails to exploit the full range of achievable designs and hence of achievable effective metamaterial properties. Many truss optimization solutions have adopted heuristic search strategies to find optimal structures by iteratively adjusting the active beams and/or nodes in the design domain, according to mechanics-based criteria3033. This, however, becomes computationally infeasible in large-scale problems due to the enormous and noisy search space. Recently, Lumpe and Stankovic34 proposed an extensive catalog of truss lattices by mimicking the molecular structure of crystalline lattices. Yet, the same fundamental issue persists as for all such catalogs of truss lattices with different topologies: there exists no finite-dimensional, continuous, and seamless design space. For example, while the “kagome” vs. “octet” lattice is interpretable by humans, it is not directly cognizant to a computer. The question is hence: how does one translate distinct lattice topologies into a unified, finite-dimensional, vector-based parameterization that can be understood by an algorithm aiming to optimize the lattice design for certain target metamaterial properties? While a pixeled/voxelated image-based parameterization (similar to conventional topology optimization) is a solution in principle, capturing slender beams in truss lattices warrants extremely high resolution, which again becomes prohibitively expensive.

To address the aforementioned limitations, we introduce a graph-theoretic approach to represent a vast design space of three-dimensional (3D) truss topologies. Every truss lattice can be naturally translated into a graph—a mathematical structure consisting of edges and nodes (i.e., struts and their intersections, respectively). The edges encode the lattice topology in the form of a nodal adjacency matrix; the nodes encode the geometric features in the form of spatial coordinates. Additional graph-level information may include, e.g., strut thickness or further cross-sectional information. While a graph as a data structure is computationally interpretable, the discontinuities across different lattice topologies are also persistent in the graph representation.

We introduce a machine learning (ML) framework to extract a generalizable and unified design space for truss lattices with diverse topologies. ML has made a significant impact in the design of metamaterials—from data-driven surrogate models for accelerating multiscale simulations3540 to the inverse design for tailored linear4147 and nonlinear4851 properties. Of particular interest to our context are generative ML models (using, e.g., variational autoencoders52 (VAEs) and generative adversarial networks53 (GANs)), which aim to learn the underlying distribution of the data itself (as opposed to discriminative models that learn to predict labels for a given input) and have been used to successfully design metamaterials21,5456. However, unlike in those approaches where the design parameterization can be formulated as a finite-dimensional vector or image, we turn to the special class of generative graph-based ML models for dealing with graph representations of truss lattices. Graph-based learning has recently gained prominence because of its ability to model non-Euclidean data representing interrelations in irregular domains, such as social networks5759, chemical molecules60,61, and material microstructures37,62. Distinct from existing works that utilize supervised graph-based models as surrogate models to provide real-time prediction of various properties of interest, e.g., homogenized elastic63 and thermal properties64 or dominant deformation mechanism65 of lattice architectures, the goal of the graph generative modeling framework proposed here is to construct a unified, continuous latent representation of a vast and discrete truss design space and its exploitation for the inverse design for both linear and nonlinear targeted mechanical properties. We here demonstrate that a VAE can successfully abstract a hidden or latent design representation of diverse graph-based truss lattices. This is achieved by using a neural network architecture, which contains an informational bottleneck and enables compressing the high-dimensional graph representation into a finite, low-dimensional, and smooth vector representation. In this latent space, any two lattices with similar topological and geometric features are located close to each other, whereas any two distant lattices can be continuously transformed into each other. New lattice designs can be straightforwardly generated by randomly sampling in the latent space. Exploration of this latent space further allows us to seamlessly search or optimize for truss lattices with exotic or tailored properties—including those that lie outside the domain of the available training data.

Results

Creating the design space

We begin by introducing our definition of the design space of truss lattices. From a practical standpoint, we focus our attention on lattices based on the periodic tessellation of a cubic representative volume element (RVE). Inspired by the cube decomposition approach66, we partition the RVE into eight equal cubes, the octants, as shown in Fig. 1a. By assuming symmetries across the three mutually orthogonal symmetry planes, it is sufficient to define the truss only within a single octant, which simplifies the complex optimization problem while ensuring great flexibility and periodic tilability. Following the graph representation of molecules63,67, truss-like structures can be described by a set of nodes connected by solid beams, which form the nodes and edges of the graph, respectively. To create a sufficiently large design space of truss structures, we define a total of 27 possible node placements within the octant (Fig. 1a): 8 vertex nodes {v0, v1, v2, v3, v4, v5, v6, v7}, 12 edge nodes {e0, e1, e2, e3, e4, e5, e6, e7, e8, e9, e10, e11}, 6 face nodes {f0, f1, f2, f3, f4, f5}, and a single body node {t0} within the volume. While the body node is free to move in 3D, edge and face nodes are restricted to be displaced only along the edge and within the face, respectively, and vertex nodes are fixed, as illustrated in Fig. 1a. We define the offsets of nodes (as in ref. 66) in the natural coordinate system, representing their relative positions with respect to the fixed vertex nodes to ensure connectivity on the outer boundaries. The complete set of node features x contains the offset of each node in its movable direction(s).

Fig. 1. Overview of the truss parametrization and the data generation scheme used to create a diverse truss lattice dataset.

Fig. 1

a Cube decomposition generates the irreducible truss pattern with possible node placements and the nodes' degrees of freedom defined on the octant. For example, an edge node has one degree of freedom and is only allowed to traverse along the edge, while a face node with two degrees of freedom can assume any position within the plane. b Graph representation of an octet truss example (vertex nodes in blue colors, and face node in purple colors), whose vertices and beams serve as input to the variational autoencoder (VAE) model. The lattice is defined by the adjacency matrix and node features. c Examples of different truss lattices are realized by varying the topology and the vertex degrees of freedom as well as the strut diameters, showcasing the wide coverage of the design space. d Effective directional Young’s moduli E and effective shear moduli G (normalized by the respective properties Es and Gs of the base material) in the three main directions and their projected values on the e1e2-, e1e3-, and e2e3-planes of 3000 structures randomly drawn from the dataset, and selected examples with extreme properties. The size of the markers is proportional to the strut radius of the unit cell; their colors indicate the effective bulk modulus. The effective stiffness was obtained by finite element (FE) homogenization with periodic boundary conditions; the radius of circular cylindrical struts is scaled to maintain a constant relative density of ρ = 0.15. The resulting variety of truss configurations in the dataset covers a large range of elastic properties.

Analogous to the above node features, the structural features of the unit cell include the truss topology, which is fully described by a list of all edges (e.g., (e0, f1) represents a beam connecting edge node e0 and face node f1). By analogy with a graph structure, we represent the lattice topology using an adjacency matrix A ∈ {0, 1}n×n, where the diagonal elements Aii = 1 (for all i = 1, …, n) and n = 27 is the total number of nodes. The adjacency matrix serves as a lookup table, where the value of 1 denotes an edge between nodes, whereas 0 indicates that an edge is not present. With the graph representation, introducing or removing beams from the truss can be easily achieved by operations on the adjacency matrix (e.g., the superposition of two structures is described by the logical disjunction, i.e., element-wise boolean OR, of two adjacency matrices). Possible defects, such as isolated nodes or struts, can be efficiently identified and resolved by examining the adjacency matrix. Here we do not explicitly consider permutation invariance or equivariance of graphs due to the inherent representation complexity in generative modeling tasks68,69. In other scenarios (such as predicting frame-indifferent properties37,70), incorporating permutation invariance or symmetry groups such as SE(3) in graph representations7174 could largely enhance the learning of the underlying relations of various structural configurations.

We leverage the above representation to construct a dataset containing a large family of truss lattices covering a wide range of mechanical properties. To this end, we begin with a set of three well-known elementary trusses as initial topologies, including the octet, body-centered-cubic, and simple cubic unit cells as 1 × 1 × 1 and 2 × 2 × 2 tessellations. Starting from those, an iterative stochastic perturbation algorithm generates novel structures by randomly inserting/removing both nodes (of edge, vertex, face, or body types) and truss connectivities. New connections are created by connecting newly inserted nodes to at least one of their nearest neighbor nodes. Random perturbations are added to the position of all nodes, while obeying the corresponding positional constraints of the vertex, edge, and face nodes. The above procedure is repeated until the obtained truss lattice satisfies the constraints on the graph connectivity (details provided in Supplementary Note 1). We build a large preliminary library, which includes a variety of trusses—from well-studied structures to unconventional ones, as shown in Fig. 1c. From the thus-obtained preliminary library, new lattices are created by superimposing two randomly sampled structures (with repetitions allowed). To ensure the physical feasibility of structures generated by random perturbations and superpositions, we enforce the constraint that the lattice is self-connected; i.e., the truss graph must have exactly one maximally connected subgraph spanning the whole graph. Intersecting beams, which may arise from the superposition of different topologies, are fixed by splitting the involved beams and inserting a new node at the intersection point. The resulting lattices constitute the design space as well as the corresponding dataset for ML.

By perturbing both the topological and geometrical features of the lattice, we create a rich database of anisotropic lattices, consisting of 965,736 unique structures. As a representative and important mechanical property, we study the full anisotropic 3D elastic stiffness tensor. All lattices obtained from the cube decomposition approach naturally possess three orthogonal plane reflection symmetries and therefore only require nine independent components to describe the orthotropic homogenized stiffness tensor; we select S=(C1111,C1122,C1133,C2222,C2233,C3333,C2323,C3131,C1212). For each structure, the effective mechanical stiffness tensor is computed by homogenization, using a finite element (FE) framework, which models individual struts as linear elastic Timoshenko beams with a circular cross-section75. We assume a homogeneous base material with Poisson’s ratio νs = 0.3 and unit Young’s modulus Es = 1 (i.e., all reported effective stiffness values are relative to the base material’s Young’s modulus). The beam thickness d of all struts is varied such that a constant relative density ρ = 0.15 is maintained across all structures. This will be helpful during property optimization, as it ensures that optimized mechanical properties do not come at the cost of an increased weight.

To visualize the property range reached by the established truss catalog, Fig. 1d shows the effective directional Young’s moduli E and the effective shear moduli G along the three principle cubic directions and their projections onto the x-y-, x-z-, and y-z-planes. Results show that the established truss database covers a wide range of Young’s moduli, spanning three orders of magnitude between 10−5 and 10−2 times the base material’s Young’s modulus. Although the initial structures used as seeds are limited to cubic symmetry, the resulting library generated by perturbing both the truss connectivity and node positions exhibits strong anisotropy and has significantly expanded the range of mechanical properties (e.g., the representative examples shown in Fig. 1d reach effective Young’s moduli E33 of ca. 36% higher than that of a simple cubic unit cell in the principle direction at the same density). Of course, the dataset could be enriched by more unique structures, using the above approach. Yet, we limit our study to the current dataset based on the performance and computational cost of the ML model (as detailed below). The data generation could also be generalized to other truss families. For example, while we only consider centrosymmetric unit cells, non-centrosymmetric unit cells can be readily constructed in a similar manner by allowing for different structures in each octant instead of applying symmetries. Compared to prior approaches that have focused on a truss unit cell catalog, our database achieves a significantly wider design space of truss structures with a relatively compact formulation based on graphs.

Generative modeling framework

The proposed design space for truss lattices is discrete and discontinuous. For example, any two truss lattices may have different numbers of nodes or nodes with different numbers of degrees of freedom (such as edge vs. face nodes). However, representing the truss lattices by a graph structure, as described above, enables the use of node features and adjacency matrix as inputs for a ML model to learn a low-dimensional, continuous, and smooth representation for the high-dimensional, discrete, and intractable graph representation.

We use a VAE containing two neural networks—an encoder and a decoder (see Fig. 2a for a schematic of the framework). Let G = (A, x) denote the graph representation of a truss lattice, defined by its adjacency matrix and node features. The encoder Qϕ (with the set of trainable parameters ϕ) maps an input graph G into two d-dimensional vectors μ(G;ϕ)Rd and σ(G;ϕ)Rd, which, respectively, prescribe the mean and covariance of a diagonal multivariate Gaussian distribution

z~N[μ1,,μd],diagσ12,,σd2. 1

Here, zRd denotes a low-dimensional vector encoding of the input graph G, also known as latent representation. While the formulation is presented in terms of σ, the neural network predicts logσ. To maintain differentiability (required for backpropagation-based neural network training), z is sampled using the reparameterization trick52 as

z=μ+ε[σ1,,σd]withε~N(0,I), 2

where ⊙ denotes element-wise multiplication. The decoder Pθ (with set of trainable parameters θ) maps the latent vector z into a graph representation G=(A,x)=Pθ(z;θ) and attempts to accurately reconstruct the original graph, i.e., GG. The autoencoding of the input graphs with such a neural network structure creates an information bottleneck76 in the latent representation. The information bottleneck only preserves the necessary meaningful information to allow accurate reconstruction of the graphs, with a significant reduction in the dimension and complexity of the original data. Consequently, in the latent space any two graphs/lattices with similar topology and geometry are located close to each other; any two distant graphs/lattices can be continuously transformed between each other by traversing the latent space.

Fig. 2. Generative modeling framework.

Fig. 2

a The variational autoencoder (VAE) model takes the graph representation G = (A, x) of truss lattices (defined by adjacency matrix A and node features x) as input to the encoder Qϕ and learns a continuous latent space over the geometries of various trusses. μ and σ denote the mean and covariance of the multivariate Gaussian distribution. The reduced representation z of truss structures is then passed to the decoder Pθ to reconstruct the lattice. An augmented multi-layer perceptron (MLP) neural network Fω predicts the mechanical properties of trusses based on their latent representation z. b The inverse design framework aims to generate truss lattices with target properties. Starting with the 100 closest matches in the training dataset as initial guesses, gradient-based optimization is applied to search for possible lattices with desirable properties in the latent space (N(0,I) denotes a uniform distribution of zero mean and the identity as the covariance matrix). The inverse design candidate structures are passed to the encoder to obtain their corresponding latent representations, which are then forwarded to the property predictor to predict the effective stiffness of the proposed lattices.

We here adopt the attributed network embedding method77 to learn the individual dependencies of the structural topology and node placements as well as their combined effects (details provided in Supplementary Note 2.2). Specifically, the adjacency matrix (containing binary values) and node features (containing continuous values) are serialized and passed through separate encoders to obtain the respective latent space distribution means μARdA and μxRdx and standard deviations σARdA and σxRdx. The embedding dimensions dA and dx are chosen such that dAx = (dA + dx) − d > 0. Note that, since the adjacency matrix is symmetric, only the upper triangular part is considered by the encoder. The final latent space distribution is obtained by partial overlap of the adjacency matrix and node features embeddings. The mean is given by

μ=μ1AμdAdAxAtopologyspecific12μdAdAx+1AμdAA+μ1xμdAxxsharedtopologyandgeometryμdAx+1xμdxxtopologyspecific, 3

where ⊕ denotes vector concatenation (the logarithm of the standard deviation, i.e., logσ, is obtained analogously using logσA and logσx). Since the adjacency matrix and node features influence the topology and geometry, respectively, subsets of the latent space dimensions offer topology-specific, geometry-specific, and shared control over the design space, the advantages of which will become apparent when discussing the results. Similar to the encoding, two separate decoders are used to output the graph G=(A,x) from a latent vector z—the topology-specific and shared dimensions of z are used to obtain the adjacency matrix A; the shared and geometry-specific dimensions of z are used to obtain the node features x.

Towards the efficient data-driven design and the discovery of new structures with desirable properties, the latent space can be associated with specific properties that we seek to optimize by a neural network surrogate model21,78 using the latent vectors as input, thus bypassing the costly FE homogenization computation. Therefore, we adapt the original VAE structure and link the latent space to the homogenized effective stiffness measures S by feeding the mean of the latent vector, i.e., μ(G; ϕ), into an additional neural network-based property predictor Fω (with trainable parameters ω).

Given a representative dataset D={(G(n),S(n)):n=1,,N} containing N structure-property pairs, the VAE and property predictor are jointly trained as

θ,ϕ,ωargminθ,ϕ,ω1Nn=1NA(n)A(n)2+x(n)x(n)2reconstructionloss+1Nn=1NS(n)Fω[μ(n)]2propertypredictionloss+n=1NDKLNμ1(n),,μd(n),diagσ1(n)2,,σd(n)2N(0,I)KullbackLeiblerdivergence. 4

The reconstruction loss enforces that the encoded graphs (equivalently, truss lattices) are accurately reconstructed (in terms of both the adjacency matrix and node features) by the decoder. The property prediction loss enforces that the property predictor outputs the stiffness of a truss lattice accurately. The Kullback-Leibler divergence (KLD)52 penalizes the divergence of the probability distribution of the latent space produced by the encoder from the standard Gaussian distribution N(0,I). This allows directly sampling the latent space using a standard Gaussian distribution and decoding truss lattices during the inference stage (as opposed to first encoding an a-priori known lattice into a latent vector and then decoding back during training), which in turn enables the design and discovery of novel trusses beyond the dataset at hand. The KLD loss further simplifies to

DKLNμ1,,μd,diagσ12,,σd2N(0,I)=12j=1dσj2+μj21logσj2. 5

For detailed derivations of the reconstruction and KLD losses, see ref. 52. All details pertaining to the neural network architectures, training protocols, and hyperparameters are presented in Supplementary Table 1.

The generative capability of the VAE enables us to explore novel yet realistic truss structures, whose mechanical properties are immediately available at minimal computational cost through the property predictor Fω. With the joint property predictor as a regularizer, the generative modeling framework helps yield a deeper understanding of the latent space, which lacks physical interpretation and hence presents new opportunities for various downstream tasks by modifying the target of the structure-property predictor, e.g., towards the classification of deformation-mechanisms of truss lattices65, or the prediction of dispersion relations79,80 and the nonlinear response50,81. Furthermore, our framework can be expanded to the simultaneous design of multiple properties by feeding the extracted features to a multi-task property predictor8284. By leveraging the correlations and shared information among different targets, we can effectively guide the design of truss lattices that have various desired properties by integrating the multi-task property predictor into a multi-objective optimization framework.

Performance of the VAE model

Our first goal is to correctly reconstruct truss structures: any given input lattice is mapped by the encoder into the latent space, from where the decoder reconstructs the truss lattice (Fig. 2a). Defining the topology reconstruction accuracy as the percentage of correctly predicted links reveals that the trained VAE model accurately captures the topological features of trusses with an accuracy score of 99.9% for the adjacency matrix. The correlation plot between the true and reconstructed node positions is presented in Supplementary Fig. 6a. The model shows high quality in the reconstruction of the geometrical features, demonstrated by R2 ≥ 99.9% across the 3D components (x, y, z) of the node positions. A comparison of representative reconstructed truss structures and the corresponding original structures from the test dataset is shown in Supplementary Fig. 6b.

Next, we assess the performance of the surrogate model for predicting the 3D effective stiffness measures S of trusses on an independent test set. As shown in the correlation plots between the true and predicted stiffness components in Supplementary Fig. 7, the trained model Fω overall achieves an R2 ≥ 98.2% accuracy across all stiffness components. Altogether, this confirms that our VAE model accurately reconstructs truss structures and predicts their effective stiffness properties.

With the jointly trained property predictor, the latent space is better organized in the sense that structures with similar mechanical properties are expected to cluster in the same region within the latent space (see also Supplementary Note 3.1), which gives important insight into the originally high-dimensional and intractable design space. Moreover, the property predictor works as an additional constraint, enforcing that points in the latent space should decode into valid and realistic truss structures, thus preserving some mechanical property information while reducing the dimensionality. We evaluate the quality and efficiency of the latent space generation by randomly sampling 1000 points from the latent space and using the decoder to reconstruct the corresponding structures. Results show that on average 82.3% (evaluated on 1000 attempts of random sampling) of randomly-selected samples can be successfully decoded into valid (i.e., physically meaningful) truss topologies—we refer to this fraction as the validity score. While it is appealing to improve the quality of samples by imposing stronger regularization, such as increasing the weight of the KLD term in Equation (4) (also known as β−VAE85), the fidelity of the reconstruction will degrade due to the inherent trade-off between reconstruction accuracy and random sample quality in VAE models. In this work, we therefore adopt the annealing schedule for the weight of the KL-divergence term86 (details provided in Supplementary Note 2.1) to dynamically tune the importance given to the regularization and reconstruction losses, thus ensuring the flexibility of the VAE reconstruction.

Exploration in the latent space of truss lattices

The continuous and low-dimensional latent space with generalization ability is particularly advantageous for the design of new structures by traversing the latent space through simple arithmetic operations of the latent representation z. While existing works that relied on a pixel/voxel-based parameterization have shown success in mapping the topology and mechanical properties in a latent space with a similar data-driven design framework21, they did not consider the impact of the different types of structural features (i.e., of connectivity and node positions) separately. In fact, manipulating the truss connectivity vs. moving nodes will expand the property space differently. For example, changes to the truss topology can have a strong effect on its deformation behavior (stretch- or bending-dominance depends primarily on the connectivity87). To this end, we adopt the joint embedding model (details provided in Supplementary Note 2.2) to encode the topological and geometrical features in different dimensions of z, while maintaining the total number of latent dimensions constant. A major advantage of this adjustment is that the importance given to each type of information can be adjusted by tuning the corresponding number of latent dimensions without increasing the model complexity. This provides flexibility and allows us to extract information that is only related to the topology or to the node position or to their interactions. As a consequence, traversals through the latent space along different axes give rise to significantly different changes in mechanical properties, which is enabled by the attributed latent embedding approach; i.e., each axis (each component of z) stores specific information about the structural pattern transformation. Figure 3 shows an example of moving along three different latent axes (see also Supplementary Information Movies 1–3), which encode the information specifically for connectivity reconstruction, node positions reconstruction, or both—in each case starting from the same truss—which leads to the illustrated changes in structural topology, geometry, or both and the corresponding 3D stiffness (visualized as elastic surfaces).

Fig. 3. Representative examples of interpolation in the latent space.

Fig. 3

Samples are generated by traversals along three different latent axes: a taken from the topology-specific, b shared topology and geometry, and c geometry-specific dimensions of the latent space. Their corresponding 3D elastic surface evolution (obtained by finite element homogenization) is shown along the interpolation path, which indicates the effective directional Young’s modulus E(d) normalized by the Young’s modulus Es of base material.

While previous work has investigated the generation of new structures by moving along a path in latent space21,78, it is usually intractable to obtain a disentangled representation of the original data space, since the complex correlation between entities is non-trivial to decompose. By contrast, our model uses a systematic latent representation for trusses, which admits human interpretation and where the truss connectivity and node positions can be independently altered. This is useful for many downstream tasks; e.g., for identifying the roles of different geometrical features and their impact on the effective truss performance.

As an illustration, we define an interpolation path between two points in latent space and reconstruct a continuous family of new trusses along the path with the decoder. The high validity score of our latent space ensures that the majority of generated new samples are physically feasible. (Possible issues such as a lack of connectivity or structural instability can be resolved by a light post-processing step.) For example, let us assume that two points in the high-dimensional latent space lie on the surface of a hypersphere rather than on a straight line, so we can interpolate between any two truss structures by applying the spherical linear interpolation88 (slerp)

SLERP(z1,z2;α)=sin(1α)θsinθz1+sinαθsinθz2, 6

where z1 and z2 are the vectors of two points in the latent space, α ∈ [0, 1] is the interpolation parameter, and z1z2=cosθ. A detailed discussion on slerp and its comparison with linear interpolation is provided in Supplementary Note 3.2. Figure 4 presents two examples of interpolating between two truss structures that have significantly different mechanical behavior (e.g., structures with the largest and smallest Young’s modulus E11, and with the largest and smallest universal anisotropy index AU, which quantifies the degree of structural anisotropy89 (details are provided in Methods)). Selected structures generated by the decoder along the interpolation path are visualized along with their respective elastic surfaces obtained from FE homogenization (see also Supplementary Movie 4). Our smooth and continuous latent space ensures that, while the start- and end-point structures have opposite extremes of mechanical properties among the dataset, the transition of the structural geometries is smooth. This provides new opportunities for the design of continuous families of truss structures with property grading, which bypasses complex optimization algorithms operating in the high-dimensional, discrete design space. In the second example of Fig. 4 (Supplementary Movie 5), we observed that—along the interpolation path between two points with extreme anisotropy values AU– new structures are generated that have a considerably higher AU-value than all trusses in the training dataset. This is possible, as we interpolate in the latent space instead of the property space. The jointly trained property predictor encourages structures with similar mechanical performance to be located in the same region in latent space. Therefore, sampling in the vicinity of a point in latent space results in a rich family of trusses with similar properties (see also Supplementary Note 3.1). Moreover, new structures generated along a smooth interpolation path, or in the vicinity of the extreme values in the dataset, are expected to exhibit continuous property changes, including unprecedented extreme values.

Fig. 4. Representative examples of interpolation in the latent space.

Fig. 4

Samples are generated by interpolation between two points in latent space, whose corresponding trusses exhibit extreme mechanical properties (in terms of directional Young’s modulus E11 and the universal anisotropy index AU). Their corresponding 3D elastic surface evolution (obtained by finite element homogenization) is shown along the interpolation path, which indicates the effective directional Young’s modulus E(d) normalized by the Young’s modulus Es of base material.

Gradient-based optimization in the latent space

The continuous latent space successfully captures the underlying mechanical features of trusses rather than simply memorizing the training data. This enables the use of gradient-based optimization techniques to guide the tailoring of truss lattices to achieve desired properties and, furthermore, to extrapolate beyond the training domain. While the forward mapping from structure to property is straightforward, the inverse design problem is ill-posed due to the one-to-many mapping from the property space to the geometry space (i.e., multiple different truss candidates may lead to the same effective properties). This can be overcome by searching for a potential structure candidate, whose reconstructed stiffness matches the queried stiffness44. To generate physically realistic truss structures or to obtain the sensitivities of properties with respect to structural features, we leverage automatic differentiation and use a backpropagation algorithm to obtain the gradients through the VAE model and the property predictor. Due to the discrete nature of truss topologies, unconstrained optimization in the latent space can be problematic and may result in invalid structures even with the variational term as a regularization, since there is no explicit constraint on the validity of generated samples when searching the whole latent space. As a remedy, we adopt an indirect approach by first reconstructing truss structures from their latent representation given by the optimizer, and passing them to the encoder to obtain the actual latent variables, which are then forwarded to the property predictor to predict the effective stiffness of generated structures (see Fig. 2). The additional encoding-decoding process ensures that candidate structures proposed by the optimizer are valid.

To demonstrate the inverse design capabilities, we apply our generative modeling framework to design truss structures with extreme mechanical properties. Considering the one-to-many mapping of properties to structures and to have a reference, we first evaluate all structures in the training dataset towards the target property. Based on that data, initial guesses are chosen as the 100 closest matches in terms of the target property. Gradient-based optimizations for each initial guess are performed in parallel, and the best solution is identified by examining the FE-reconstructed properties of the 100 optimal solutions (the property predictor only serves as a computational shortcut to obtain the effective response during optimization). This approach identifies multiple candidate truss structures, which exhibit similar mechanical behavior—allowing for the selection of optimal structures under consideration of additional target attributes such as manufacturability or further properties of interest (see the detailed discussion in Supplementary Note 3.3).

Figure 5 illustrates three examples of the optimal design of truss lattices towards extreme properties. In all three cases, the trained generative models produce robust designs with properties far outside of the training domain through careful tuning of architectures, thus greatly expanding the limits of the property range. The first example maximizes the directional Young’s modulus E22, for which Fig. 5a shows the optimization path in the property space. Intermediate truss lattices are visualized to demonstrate the effect of the structural evolution (including the Voigt upper bounds EVoigt = Es ⋅ ρ and GVoigt = Gs ⋅ ρ). The optimization scheme gradually adjusts the beam arrangements along the e2-direction, exceeding the maximum Young’s modulus in the training dataset (E22,max=0.068) by 51.5%. The second example in Fig. 5b shows structures optimized for a maximum auxetic behavior (i.e., for a maximum negative Poisson’s ratio ν21) in the e1-e2-plane. The optimization scheme reaches an optimal structure with ν21 = − 2.711, which is a 42.9% improvement over the most negative Poisson’s ratio in the training set (ν21,min=1.897). Finally, Fig. 5c illustrates the search for near-pentamode structures90, i.e., for fluid-like trusses with a high bulk-to-shear modulus ratio—being soft to shear but (close to) incompressible. The gradient optimization scheme here maximizes the ratio of the bulk modulus to the shear modulus. (Since the structures are anisotropic, we use the Voigt average bulk and shear moduli91, KV and GV, respectively, for optimization.) Results show how the optimal structure yields a ratio of KV/GV that is 28.6% higher than the maximum value contained in the training dataset (which is 14). While this may not be an impressive improvement compared to existing pentamode designs, we stress that—in all three optimization examples—the generative model improved the target properties significantly by only small structural modifications (compare, e.g., the initial and final structures in Fig. 5b and c). Such small changes, which are unlikely to be found by design intuition and experience, demonstrate the complexity of the design and property spaces and highlight the benefits of our approach.

Fig. 5. Inverse-designed truss metamaterials based on gradient optimization.

Fig. 5

a Maximizing Young’s modulus E22, b minimizing Poisson’s ratio ν21, and c maximizing the bulk-to-shear modulus ratio KV/GV of truss lattices. Each example shows the property evolution vs. the number of optimization iteration steps, including a few selected structures at the indicated points, as well as the property path compared to the training dataset in the relevant property spaces (each dot represents a truss in the training data, which is color-coded by the (dimensionless) radius of the beams with circular cross-section). Source data are provided as a Source Data file.

Design for tailored nonlinear mechanical response

In addition to target properties in the linear regime, we further verify the efficiency and generalization ability of our framework by the inverse design of nonlinear mechanical metamaterials. We consider a subset of the training dataset that contains 383,729 unique structures, striking a balance between computational cost and model performance. To characterize the effective behavior of truss structures, we homogenize the stress–strain response of the truss unit cells with periodic boundary conditions under uniaxial compression subjected to a compressive strain of up to 25% in the z-direction. The established truss database and their corresponding nonlinear stress–strain responses are used to train the generative modeling framework with the objective of enabling the design of novel metamaterials with desired nonlinear responses. To facilitate training of the ML model, we reduce the dimensionality of the learning labels and describe the stress–strain curve by a vector σt = [σ(0.5%), σ(2.5%), . . . , σ(24.5%)]T, which contains the compressive stress values at 13 equally spaced strain points along the range of applied compressive strains. Figure 6a shows the comparison between the predicted vs. true stress–strain curves for four representative examples from the test dataset, exhibiting nonlinear distinct behaviors. The trained property predictor accurately predicts the nonlinear responses for unseen truss structures, achieving an overall normalized root mean square error (NRMSE) of 4.5%, which confirms that the property predictor provides an effective estimate of the nonlinear responses of diverse trusses.

Fig. 6. Generative design of truss metamaterials with target nonlinear response.

Fig. 6

a Comparison between the stress–strain curves predicted by the property predictor (solid dots) and the ground truth (solid lines) for four representative truss structures. Each of the four examples shows the corresponding truss unit cell and its (dimensionless) displacement magnitude under an applied compressive vertical strain ε33 = 25%. b Inverse-designed truss structures obtained from gradient optimization with target nonlinear responses. Each example shows the comparison of responses between the best match from the training dataset and the optimized solution as well as the corresponding truss unit cells. All shown true stress–strain curves were obtained by finite element homogenization.

Next, we demonstrate the inverse design capabilities of the model by applying our generative modeling framework to design truss structures matching a given target stress–strain response. Figure 6b shows two benchmark examples of different stress–strain response targets. First, we select a design target that exceeds the stiffest response in the considered training dataset by 30%. As shown in Fig. 6b (i), the optimal truss structure closely matches the target response with an NRMSE of 3.8%, which showcases the capability of the framework to design truss structures that exhibit specific desired responses, even beyond the range of observed behaviors in the training dataset. Second, we consider a target stress–strain curve displaying pronounced softening behavior, with a minimum NRMSE between the top pick within the training dataset and the target of 9.1%, as illustrated in Fig. 6b (ii). The generated optimal design significantly outperforms the best match in the training dataset for the considered target (with an NRMSE of 3.0%). This demonstrates that the constructed continuous latent space effectively captures the essential features and (some of them) underlying physics of periodic trusses, which enables the inverse design of novel truss designs that closely match unseen responses in both the linear and nonlinear regimes, clearly surpassing the limits of the training dataset.

Discussion

The presented generative modeling framework constructs a continuous, low-dimensional latent space of truss metamaterials. By analogy with molecules, we leverage the graph representation to interpret periodic trusses as graphs, thus providing an efficient, consistent, and general parameterization, which covers a wide range of truss structures and a tremendous space of anisotropic mechanical properties. Encoding the information related to the truss connectivity, the node positions, and their shared information in different dimensions of the latent representation enables a human interpretation of the otherwise intractable latent space. It also provides flexibility and tunability in manipulating structural features of truss lattices to achieve optimal properties. A major advantage of the unified and continuous latent representation is that novel truss structures can be conveniently generated by simple operations in the latent space, including sampling in the vicinity of known data points, traversing along the latent axes, and interpolating between two points. While classical VAEs often suffer from the issue of opacity and a lack of a physically meaningful representation, the proposed framework tackles this challenge by a jointly trained neural network to predict the truss properties from the latent space—thus allowing us to creatively navigate the latent space and to extrapolate with gradient-based optimization techniques to unseen, extreme properties outside the original training domain. This admits identifying optimal lightweight truss lattices with target combinations of, e.g., the elastic constants in 3D and nonlinear stress–strain responses. The proposed design framework admits extension to other properties of truss metamaterials45,92,93 by modifying the property predictor as well as to other types of metamaterials37,44,9497 by modifying the design parameterization. This highlights the potential of our framework to analyze and optimize a broad range of metamaterials. The physical interpretability and extrapolation ability open up new avenues for the discovery of new metamaterials and lend inspiration for designing cellular structures with tailored properties by tuning the architectural features.

Methods

Data generation

Supplementary Fig. 1 shows the details of the generation of a diverse truss lattice dataset. We define the truss graph in the octant within a domain Ω0=[0,1]3R3, which is populated into truss structures in the domain Ω=[1,1]3R3 through reflections about the three mutually orthogonal symmetry planes. Starting from the five elementary truss lattices shown in Step 1 of Supplementary Fig. 1, new structures are created by randomly perturbing both the node positions and connectivities for several iterations. Supplementary Fig. 2 shows the node positions and connectivities of the five considered elementary truss structures. Node positions are altered by offsets defined in the natural coordinate system66 and sampled from the uniform distribution λ~U(0.5,0.5). Based on the initial truss structures, new connectivities may be introduced by removing available nodes or inserting new nodes with the following constraints: (1) the established structure is a single connected component; (2) all beams are shorter than rmax=3/2 (times the unit side length of the unit cell); the maximum permissible length of a beam connection rmax is chosen according to the longest connection in the initial five elementary trusses; (3) no dangling connections exist within the structure (every node has at least two connected beams). Each geometry is perturbed for 10 iterations to generate a library that contains a wide range of truss structures with several unique topologies. From the established set, we randomly sample two lattices with repetitions allowed, which are then superimposed according to their matching nodes to yield a more diverse dataset. The full dataset considered for training the generative models contains 965, 736 lattices and their homogenized effective stiffness properties.

Computational homogenization

The effective stiffness tensor C of all truss structures in our library is computed by FE homogenization with periodic boundary conditions75, using on an in-house C++ FE code (available at http://ae108.ethz.ch). Each strut in the truss unit cell is modeled as a linear elastic Timoshenko beam with a circular cross-section. The strut radius of each unit cell is scaled to maintain a constant relative density of ρ = 0.15. We visualize the 3D anisotropic stiffness of truss lattices as elastic surfaces, which indicate the effective directional Young’s modulus E(d) for all directions d ∈ S2 as

E(d)=i,j,k,l=13Cijkl1didjdkdl1. 7

In Figs. 3 and 4, we presented representative samples of novel truss lattices generated by interpolating between known structures that exhibit extreme universal anisotropy values AU89. AU can be interpreted as a generalization of the Zener index98, which applies to structures with cubic symmetry and can be expressed as

AU=5GVGR+KVKR6, 8

where GV and KV are the Voigt estimates for, respectively, the shear and bulk moduli91, and GR and KR are the Reuss estimates for, respectively, the shear and bulk moduli99. Specifically, Reuss proposed the following relations for the bulk modulus K and shear modulus G in terms of the compliance components Sijkl:

KR1=(S1111+S2222+S3333)+2(S1122+S1133+S2233), 9
15GR1=4(S1111+S2222+S3333)4(S1122+S1133+S2233)+3(S4444+S5555+S6666). 10

Analogously, we adopt the Voigt average bulk and shear moduli, calculated from the anisotropic stiffness components Cijkl as, respectively,

9KV=(C1111+C2222+C3333)+2(C1122+C1133+C2233), 11
15GV=(C1111+C2222+C3333)(C1122+C1133+C2233)+3(C4444+C5555+C6666). 12

ML framework

Details of the optimized dimensions and hyperparameters (e.g., the number of hidden layers and nodes in each layer, activation functions, learning rates, etc) of the VAE model and the property predictor are provided in Supplementary Table 1. Thresholding is applied at the end of the connectivity decoder by a sigmoid function to achieve a binary connectivity matrix. 1% of the generated dataset is used for the tuning and optimization of hyperparameters. We used the PyTorch100 package throughout the implementation of the proposed generative learning framework and leveraged its automatic differentiation engine, autograd, to automatically obtain the gradients of the homogenized effective properties with respect to the structural and geometrical features towards the optimization and inverse design of truss lattices. To remedy the issue of vanishing KL-divergence term101, we schedule the weight of the KL-divergence term β via the cyclical schedule86 shown in Supplementary Fig. 3. The training process is split into several cycles, and in each cycle β is gradually increased from 0 to 1, using a linear function after 50 epochs. For a detailed performance comparison of various annealing schemes for β, see refs. 86,101,102.

Details of the data generation (Section 1), the ML protocols (Section 2.1), the implementation of the overlapping embedding model (Section 2.2), the NN model performance (Section 2.3), exploration in the latent space including sampling (Section 3.1) and interpolation (Section 3.2), details on the gradient-based optimization in the latent space (Section 3.3), details on the inverse design of truss metamaterials with target nonlinear responses (Section 3.4), and the computational efficiency estimates (Section 4) are summarized in the Supplementary Information.

Supplementary information

Peer Review File (27.5MB, pdf)
41467_2023_42068_MOESM3_ESM.pdf (48.3KB, pdf)

Description of Additional Supplementary Files

Supplementary Movie 1 (9.8MB, mov)
Supplementary Movie 2 (14.7MB, mov)
Supplementary Movie S (16.5MB, mov)
Supplementary Movie 5 (10.4MB, mov)

Acknowledgements

This research received financial support from Adidas as well as from ETH Zurich through the ETH+ grant SynMatLab. K.K. acknowledges the support from a Marie-Sklodowska Curie Postdoctoral Fellowship under Grant Agreement No. 101024077. The authors gratefully acknowledge the support from Adidas and the discussions with Dr. Ladan Salari-Sharif and Derek Luther.

Author contributions

L.Z.: Methodology, Software, Validation, Data Curation, Visualization, Writing—Original Draft; K.K.: Software, Data Curation; S.K.: Conceptualization, Methodology, Writing—Review & Editing, Supervision; D.M.K.: Conceptualization, Methodology, Writing—Review & Editing, Supervision.

Peer review

Peer review information

Nature Communications thanks Angkur Jyoti Dipanka Shaikeea, Jiayi Yan, Yao Chen and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Data availability

The training data including truss structures and their effective homogenized properties generated in this study have been deposited in the ETHZ Research Collection103. Source data are provided with this paper.

Code availability

The code used to train the generative modeling framework and obtain inverse designs of truss structures has been uploaded to Github104. The FE code used for homogenization in this study is available in the ae108 library105.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Siddhant Kumar, Email: sid.kumar@tudelft.nl.

Dennis M. Kochmann, Email: dmk@ethz.ch

Supplementary information

The online version contains supplementary material available at 10.1038/s41467-023-42068-x.

References

  • 1.Nicolaou ZG, Motter AE. Mechanical metamaterials with negative compressibility transitions. Nat. Mater. 2012;11:608–613. doi: 10.1038/nmat3331. [DOI] [PubMed] [Google Scholar]
  • 2.Qu J, Gerber A, Mayer F, Kadic M, Wegener M. Experiments on metamaterials with negative effective static compressibility. Phys. Rev. X. 2017;7:041060. [Google Scholar]
  • 3.Babaee S, et al. 3D soft metamaterials with negative poisson’s ratio. Adv. Mater. 2013;25:5044–5049. doi: 10.1002/adma.201301986. [DOI] [PubMed] [Google Scholar]
  • 4.Yasuda H, Yang J. Reentrant origami-based metamaterials with negative poisson’s ratio and bistability. Phys. Rev. Lett. 2015;114:185502. doi: 10.1103/PhysRevLett.114.185502. [DOI] [PubMed] [Google Scholar]
  • 5.Bückmann T, Thiel M, Kadic M, Schittny R, Wegener M. An elasto-mechanical unfeelability cloak made of pentamode metamaterials. Nat. Commun. 2014;5:1–6. doi: 10.1038/ncomms5130. [DOI] [PubMed] [Google Scholar]
  • 6.Yuan S, Chua CK, Zhou K. 3D-printed mechanical metamaterials with high energy absorption. Adv. Mater. Technol. 2019;4:1800419. doi: 10.1002/admt.201800419. [DOI] [Google Scholar]
  • 7.Kumar S, Ubaid J, Abishera R, Schiffer A, Deshpande V. Tunable energy absorption characteristics of architected honeycombs enabled via additive manufacturing. ACS Appl. Mater. Interfaces. 2019;11:42549–42560. doi: 10.1021/acsami.9b12880. [DOI] [PubMed] [Google Scholar]
  • 8.Guell Izard A, Bauer J, Crook C, Turlo V, Valdevit L. Ultrahigh energy absorption multifunctional spinodal nanoarchitectures. Small. 2019;15:1903834. doi: 10.1002/smll.201903834. [DOI] [PubMed] [Google Scholar]
  • 9.Chen Y, Li T, Scarpa F, Wang L. Lattice metamaterials with mechanically tunable poisson’s ratio for vibration control. Phys. Rev. Appl. 2017;7:024012. doi: 10.1103/PhysRevApplied.7.024012. [DOI] [Google Scholar]
  • 10.Li Y, Baker E, Reissman T, Sun C, Liu WK. Design of mechanical metamaterials for simultaneous vibration isolation and energy harvesting. Appl. Phys. Lett. 2017;111:251903. doi: 10.1063/1.5008674. [DOI] [Google Scholar]
  • 11.Evans AG, Hutchinson JW, Ashby MF. Multifunctionality of cellular metal systems. Prog. Mater. Sci. 1998;43:171–221. doi: 10.1016/S0079-6425(98)00004-8. [DOI] [Google Scholar]
  • 12.Schaedler TA, et al. Ultralight metallic microlattices. Science. 2011;334:962–965. doi: 10.1126/science.1211649. [DOI] [PubMed] [Google Scholar]
  • 13.Wallach J, Gibson L. Mechanical behavior of a three-dimensional truss material. Int. J. Solids Struct. 2001;38:7181–7196. doi: 10.1016/S0020-7683(00)00400-5. [DOI] [Google Scholar]
  • 14.Deshpande VS, Fleck NA, Ashby MF. Effective properties of the octet-truss lattice material. J. Mech. Phys. Solids. 2001;49:1747–1769. doi: 10.1016/S0022-5096(01)00010-2. [DOI] [Google Scholar]
  • 15.Meza LR, et al. Reexamining the mechanical property space of three-dimensional lattice architectures. Acta Mater. 2017;140:424–432. doi: 10.1016/j.actamat.2017.08.052. [DOI] [Google Scholar]
  • 16.Portela CM, Greer JR, Kochmann DM. Impact of node geometry on the effective stiffness of non-slender three-dimensional truss lattice architectures. Extreme Mech. Lett. 2018;22:138–148. doi: 10.1016/j.eml.2018.06.004. [DOI] [Google Scholar]
  • 17.Sharpe, C., Seepersad, C. C., Watts, S. & Tortorelli, D. Design of mechanical metamaterials via constrained bayesian optimization. In International Design Engineering Technical Conferences and Computers and Information in Engineering Conference, vol. 51753, V02AT03A029 (American Society of Mechanical Engineers, 2018).
  • 18.Tran A, Tran M, Wang Y. Constrained mixed-integer gaussian mixture bayesian optimization and its applications in designing fractal and auxetic metamaterials. Struct. Multidiscip. Optim. 2019;59:2131–2154. doi: 10.1007/s00158-018-2182-1. [DOI] [Google Scholar]
  • 19.Watts S, Arrighi W, Kudo J, Tortorelli DA, White DA. Simple, accurate surrogate models of the elastic response of three-dimensional open truss micro-architectures with applications to multiscale topology design. Struct. Multidiscip. Optim. 2019;60:1887–1920. doi: 10.1007/s00158-019-02297-5. [DOI] [Google Scholar]
  • 20.Feng J, Liu B, Lin Z, Fu J. Isotropic octet-truss lattice structure design and anisotropy control strategies for implant application. Mater. Des. 2021;203:109595. doi: 10.1016/j.matdes.2021.109595. [DOI] [Google Scholar]
  • 21.Wang L, et al. Deep generative modeling for mechanistic-based learning and design of metamaterial systems. Comput. Methods Appl. Mech. Eng. 2020;372:113377. doi: 10.1016/j.cma.2020.113377. [DOI] [Google Scholar]
  • 22.Zheng X, et al. Ultralight, ultrastiff mechanical metamaterials. Science. 2014;344:1373–1377. doi: 10.1126/science.1252291. [DOI] [PubMed] [Google Scholar]
  • 23.Schumacher C, et al. Microstructures to control elasticity in 3D printing. ACM Trans. Graph. 2015;34:1–13. doi: 10.1145/2766926. [DOI] [Google Scholar]
  • 24.Tancogne-Dejean T, Mohr D. Elastically-isotropic truss lattice materials of reduced plastic anisotropy. Int. J. Solids and Struct. 2018;138:24–39. doi: 10.1016/j.ijsolstr.2017.12.025. [DOI] [Google Scholar]
  • 25.Martínez J, et al. Star-shaped metrics for mechanical metamaterial design. ACM Trans. Graph. 2019;38:1–13. doi: 10.1145/3306346.3322989. [DOI] [Google Scholar]
  • 26.Jia Z, Liu F, Jiang X, Wang L. Engineering lattice metamaterials for extreme property, programmability, and multifunctionality. J. Appl. Phys. 2020;127:150901. doi: 10.1063/5.0004724. [DOI] [Google Scholar]
  • 27.Zhang P, et al. Mechanical design and energy absorption of 3D novel hybrid lattice metamaterials. Sci. China Technol. Sci. 2021;64:2220–2228. doi: 10.1007/s11431-020-1756-x. [DOI] [Google Scholar]
  • 28.Zhang Z, Zhang L, Song B, Yao Y, Shi Y. Bamboo-inspired, simulation-guided design and 3D printing of light-weight and high-strength mechanical metamaterials. Appl. Mater. Today. 2022;26:101268. doi: 10.1016/j.apmt.2021.101268. [DOI] [Google Scholar]
  • 29.Xu S, Shen J, Zhou S, Huang X, Xie YM. Design of lattice structures with controlled anisotropy. Mater. Des. 2016;93:443–447. doi: 10.1016/j.matdes.2016.01.007. [DOI] [Google Scholar]
  • 30.Nguyen J, Park S-i, Rosen D. Heuristic optimization method for cellular structure design of light weight components. Int. J. Precis. Eng. Manuf. 2013;14:1071–1078. doi: 10.1007/s12541-013-0144-5. [DOI] [Google Scholar]
  • 31.Alzahrani M, Choi S-K, Rosen DW. Design of truss-like cellular structures using relative density mapping method. Mater. Des. 2015;85:349–360. doi: 10.1016/j.matdes.2015.06.180. [DOI] [Google Scholar]
  • 32.Chougrani L, Pernot J-P, Véron P, Abed S. Parts internal structure definition using non-uniform patterned lattice optimization for mass reduction in additive manufacturing. Eng. Comput. 2019;35:277–289. doi: 10.1007/s00366-018-0598-2. [DOI] [Google Scholar]
  • 33.Azizi M, Aickelin U, Khorshidi HA, Shishehgarkhaneh MB. Shape and size optimization of truss structures by chaos game optimization considering frequency constraints. J. Adv. Res. 2022;41:89–100. doi: 10.1016/j.jare.2022.01.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Lumpe TS, Stankovic T. Exploring the property space of periodic cellular structures based on crystal networks. Proc. Natl. Acad. Sci. USA. 2021;118:e2003504118. doi: 10.1073/pnas.2003504118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.White DA, Arrighi WJ, Kudo J, Watts SE. Multiscale topology optimization using neural network surrogate models. Comput. Methods Appl. Mech. Eng. 2019;346:1118–1135. doi: 10.1016/j.cma.2018.09.007. [DOI] [Google Scholar]
  • 36.Mozaffar M, et al. Deep learning predicts path-dependent plasticity. Proc. Natl. Acad. Sci. USA. 2019;116:26414–26420. doi: 10.1073/pnas.1911815116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Vlassis NN, Ma R, Sun W. Geometric deep learning for computational mechanics part i: anisotropic hyperelasticity. Comput. Methods Appl. Mech. Eng. 2020;371:113299. doi: 10.1016/j.cma.2020.113299. [DOI] [Google Scholar]
  • 38.Wu L, Zulueta K, Major Z, Arriaga A, Noels L. Bayesian inference of non-linear multiscale model parameters accelerated by a deep neural network. Comput. Methods Appl. Mech. Eng. 2020;360:112693. doi: 10.1016/j.cma.2019.112693. [DOI] [Google Scholar]
  • 39.Fuhg JN, et al. Model-data-driven constitutive responses: application to a multiscale computational framework. Int. J. Eng. Sci. 2021;167:103522. doi: 10.1016/j.ijengsci.2021.103522. [DOI] [Google Scholar]
  • 40.Zheng L, Kumar S, Kochmann DM. Data-driven topology optimization of spinodoid metamaterials with seamlessly tunable anisotropy. Comput. Methods Appl. Mech. Eng. 2021;383:113894. doi: 10.1016/j.cma.2021.113894. [DOI] [Google Scholar]
  • 41.Ma W, Cheng F, Xu Y, Wen Q, Liu Y. Probabilistic representation and inverse design of metamaterials based on a deep generative model with semi-supervised learning strategy. Adv. Mater. 2019;31:1901111. doi: 10.1002/adma.201901111. [DOI] [PubMed] [Google Scholar]
  • 42.Chen C-T, Gu GX. Generative deep neural networks for inverse materials design using backpropagation and active learning. Adv. Sci. 2020;7:1902607. doi: 10.1002/advs.201902607. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Xue T, et al. A data-driven computational scheme for the nonlinear mechanical properties of cellular mechanical metamaterials under large deformation. Soft Matter. 2020;16:7524–7534. doi: 10.1039/D0SM00488J. [DOI] [PubMed] [Google Scholar]
  • 44.Kumar S, Tan S, Zheng L, Kochmann DM. Inverse-designed spinodoid metamaterials. npj Comput. Mater. 2020;6:1–10. doi: 10.1038/s41524-020-0341-6. [DOI] [Google Scholar]
  • 45.Challapalli A, Patel D, Li G. Inverse machine learning framework for optimizing lightweight metamaterials. Mater. Des. 2021;208:109937. doi: 10.1016/j.matdes.2021.109937. [DOI] [Google Scholar]
  • 46.Wang Y, Zeng Q, Wang J, Li Y, Fang D. Inverse design of shell-based mechanical metamaterial with customized loading curves based on machine learning and genetic algorithm. Comput. Methods Appl. Mech. Eng. 2022;401:115571. doi: 10.1016/j.cma.2022.115571. [DOI] [Google Scholar]
  • 47.Bastek J-H, Kumar S, Telgen B, Glaesener RN, Kochmann DM. Inverting the structure–property map of truss metamaterials by deep learning. Proc. Natl. Acad. Sci. USA. 2022;119:e2111505119. doi: 10.1073/pnas.2111505119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Ma C, et al. Accelerated design and characterization of non-uniform cellular materials via a machine-learning based framework. npj Comput. Mater. 2020;6:40. doi: 10.1038/s41524-020-0309-6. [DOI] [Google Scholar]
  • 49.Yamanaka A, Kamijyo R, Koenuma K, Watanabe I, Kuwabara T. Deep neural network approach to estimate biaxial stress–strain curves of sheet metals. Mater. Des. 2020;195:108970. doi: 10.1016/j.matdes.2020.108970. [DOI] [Google Scholar]
  • 50.Deng B, et al. Inverse design of mechanical metamaterials with target nonlinear response via a neural accelerated evolution strategy. Adv. Mater. 2022;34:2206238. doi: 10.1002/adma.202206238. [DOI] [PubMed] [Google Scholar]
  • 51.Dong L, Wang J, Wang D. Modeling and design of three-dimensional voxel printed lattice metamaterials. Addit. Manuf. 2023;69:103532. [Google Scholar]
  • 52.Kingma, D. P. & Welling, M. Auto-encoding variational bayes. arXivhttps://arxiv.org/abs/1312.6114 (2013).
  • 53.Goodfellow I, et al. Generative adversarial networks. Commun. ACM. 2020;63:139–144. doi: 10.1145/3422622. [DOI] [Google Scholar]
  • 54.Sanchez-Lengeling B, Aspuru-Guzik A. Inverse molecular design using machine learning: generative models for matter engineering. Science. 2018;361:360–365. doi: 10.1126/science.aat2663. [DOI] [PubMed] [Google Scholar]
  • 55.Liu Z, Zhu D, Rodrigues SP, Lee K-T, Cai W. Generative model for the inverse design of metasurfaces. Nano Lett. 2018;18:6570–6576. doi: 10.1021/acs.nanolett.8b03171. [DOI] [PubMed] [Google Scholar]
  • 56.Gurbuz C, et al. Generative adversarial networks for the design of acoustic metamaterials. J. Acoust. Soc. Am. 2021;149:1162–1174. doi: 10.1121/10.0003501. [DOI] [PubMed] [Google Scholar]
  • 57.Hamilton, W., Ying, Z. & Leskovec, J. Inductive representation learning on large graphs. Adv. Neural Inf. Process.30https://arxiv.org/abs/1706.02216 (2017).
  • 58.Qiu, J. et al. Deepinf: Social influence prediction with deep learning. In: Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining, 2110–2119 (2018).
  • 59.Zhang, M. & Chen, Y. Link prediction based on graph neural networks. Adv. Neural Inf. Process.31https://arxiv.org/abs/1802.09691 (2018).
  • 60.Wu Z, et al. Moleculenet: a benchmark for molecular machine learning. Chem. Sci. 2018;9:513–530. doi: 10.1039/C7SC02664A. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Jin, W., Barzilay, R. & Jaakkola, T. Junction tree variational autoencoder for molecular graph generation. In: International conference on machine learning, 2323–2332 (PMLR, 2018).
  • 62.Dai M, Demirel MF, Liang Y, Hu J-M. Graph neural networks for an accurate and interpretable prediction of the properties of polycrystalline materials. npj Comput. Mater. 2021;7:1–9. doi: 10.1038/s41524-021-00574-w. [DOI] [Google Scholar]
  • 63.Ross, E. & Hambleton, D. Using graph neural networks to approximate mechanical response on 3D lattice structures. In: Proceedings of AAG2020-Advances in Architectural Geometry 24, 466–485 (2021).
  • 64.Meyer PP, Bonatti C, Tancogne-Dejean T, Mohr D. Graph-based metamaterials: deep learning of structure-property relations. Mater. Des. 2022;223:111175. doi: 10.1016/j.matdes.2022.111175. [DOI] [Google Scholar]
  • 65.Indurkar, P. P., Karlapati, S., Shaikeea, A. J. D. & Deshpande, V. S. Predicting deformation mechanisms in architected metamaterials using gnn. arXivhttps://arxiv.org/abs/2202.09427 (2022).
  • 66.Panetta, J. et al. Elastic textures for additive fabrication. ACM Trans. Graph.34, 135 (2015).
  • 67.Duvenaud, D. K. et al. Convolutional networks on graphs for learning molecular fingerprints. Adv. Neural Inf. Process.28https://arxiv.org/abs/1509.09292 (2015).
  • 68.Simonovsky, M. & Komodakis, N. Graphvae: Towards generation of small graphs using variational autoencoders. In: International conference on artificial neural networks, 412–422 (Springer, 2018).
  • 69.You, J., Ying, R., Ren, X., Hamilton, W. & Leskovec, J. Graphrnn: Generating realistic graphs with deep auto-regressive models. In: International conference on machine learning, 5708–5717 (PMLR, 2018).
  • 70.Vlassis NN, Sun W. Geometric learning for computational mechanics part ii: graph embedding for interpretable multiscale plasticity. Comput. Methods Appl. Mech. Eng. 2023;404:115768. doi: 10.1016/j.cma.2022.115768. [DOI] [Google Scholar]
  • 71.Thomas, N. et al. Tensor field networks: rotation-and translation-equivariant neural networks for 3D point clouds. arXivhttps://arxiv.org/abs/1802.08219 (2018).
  • 72.Fuchs F, Worrall D, Fischer V, Welling M. Se (3)-transformers: 3D roto-translation equivariant attention networks. Adv. Neural Inf. Process. Syst. 2020;33:1970–1981. [Google Scholar]
  • 73.Batzner S, et al. E (3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials. Nat. Commun. 2022;13:2453. doi: 10.1038/s41467-022-29939-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Cai, C. et al. Equivariant geometric learning for digital rock physics: estimating formation factor and effective permeability tensors from morse graph. Int. J. Multiscale Comput. Eng.21https://arxiv.org/abs/2104.05608 (2023).
  • 75.Glaesener RN, Träff EA, Telgen B, Canonica RM, Kochmann DM. Continuum representation of nonlinear three-dimensional periodic truss networks by on-the-fly homogenization. Int. J. Solids Struct. 2020;206:101–113. doi: 10.1016/j.ijsolstr.2020.08.013. [DOI] [Google Scholar]
  • 76.Tishby, N., Pereira, F. C. & Bialek, W. The information bottleneck method. arXivhttps://arxiv.org/abs/physics/0004057 (2000).
  • 77.Lerique S, Abitbol JL, Karsai M. Joint embedding of structure and features via graph convolutional networks. Appl. Netw. Sci. 2020;5:1–24. doi: 10.1007/s41109-019-0237-x. [DOI] [Google Scholar]
  • 78.Gómez-Bombarelli R, et al. Automatic chemical design using a data-driven continuous representation of molecules. ACS Cent. Sci. 2018;4:268–276. doi: 10.1021/acscentsci.7b00572. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.da Silva Ferreira A, Malheiros-Silveira GN, Hernández-Figueroa HE. Computing optical properties of photonic crystals by using multilayer perceptron and extreme learning machine. J. Light. Technol. 2018;36:4066–4073. doi: 10.1109/JLT.2018.2856364. [DOI] [Google Scholar]
  • 80.Liu C-X, Yu G-L. Predicting the dispersion relations of one-dimensional phononic crystals by neural networks. Sci. Rep. 2019;9:1–10. doi: 10.1038/s41598-019-51662-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Maurizi M, Gao C, Berto F. Predicting stress, strain and deformation fields in materials and structures with graph neural networks. Sci. Rep. 2022;12:21834. doi: 10.1038/s41598-022-26424-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Capela, F., Nouchi, V., Van Deursen, R., Tetko, I. V. & Godin, G. Multitask learning on graph neural networks applied to molecular property predictions. arXivhttps://arxiv.org/pdf/1910.13124.pdf (2019).
  • 83.Sanyal, S. et al. Mt-cgcnn: integrating crystal graph convolutional neural network with multitask learning for material property prediction. arXivhttps://arxiv.org/abs/1811.05660 (2018).
  • 84.Pasini ML, Zhang P, Reeve ST, Choi JY. Multi-task graph neural networks for simultaneous prediction of global and atomic properties in ferromagnetic systems. Mach. Learn.: Sci. Technol. 2022;3:025007. [Google Scholar]
  • 85.Higgins, I. et al. beta-vae: learning basic visual concepts with a constrained variational framework. In: International Conference on Learning Representations (2016).
  • 86.Fu, H. et al. Cyclical annealing schedule: a simple approach to mitigating kl vanishing. arXivhttps://arxiv.org/abs/1903.10145 (2019).
  • 87.Deshpande V, Ashby M, Fleck N. Foam topology: bending versus stretching dominated architectures. Acta Mater. 2001;49:1035–1040. doi: 10.1016/S1359-6454(00)00379-7. [DOI] [Google Scholar]
  • 88.Shoemake, K. Animating rotation with quaternion curves. In: Proceedings of the 12th annual conference on Computer graphics and interactive techniques, 245–254 (1985).
  • 89.Ranganathan SI, Ostoja-Starzewski M. Universal elastic anisotropy index. Phys. Rev. Lett. 2008;101:055504. doi: 10.1103/PhysRevLett.101.055504. [DOI] [PubMed] [Google Scholar]
  • 90.Milton GW, Cherkaev A. Which elasticity tensors are realizable. J. Eng. Mater. Technol. 1995;117:483–493. doi: 10.1115/1.2804743. [DOI] [Google Scholar]
  • 91.Voigt, W. Lehrbuch der kristallphysik: (mit ausschluss der kristalloptik), vol. 34 (BG Teubner, 1910).
  • 92.Wilt JK, Yang C, Gu GX. Accelerating auxetic metamaterial design with deep learning. Adv. Eng. Mater. 2020;22:1901266. doi: 10.1002/adem.201901266. [DOI] [Google Scholar]
  • 93.Maurizi M, Gao C, Berto F. Inverse design of truss lattice materials with superior buckling resistance. npj Comput. Mater. 2022;8:247. doi: 10.1038/s41524-022-00938-w. [DOI] [Google Scholar]
  • 94.Yamaguchi K, Yasuda H, Tsujikawa K, Kunimine T, Yang J. Graph-theoretic estimation of reconfigurability in origami-based metamaterials. Mater. Des. 2022;213:110343. doi: 10.1016/j.matdes.2021.110343. [DOI] [Google Scholar]
  • 95.Kollmann HT, Abueidda DW, Koric S, Guleryuz E, Sobh NA. Deep learning for topology optimization of 2d metamaterials. Mater. Des. 2020;196:109098. doi: 10.1016/j.matdes.2020.109098. [DOI] [Google Scholar]
  • 96.Chen Z, Ogren A, Daraio C, Brinson LC, Rudin C. How to see hidden patterns in metamaterials with interpretable machine learning. Extreme Mech. Lett. 2022;57:101895. doi: 10.1016/j.eml.2022.101895. [DOI] [Google Scholar]
  • 97.Alderete NA, Pathak N, Espinosa HD. Machine learning assisted design of shape-programmable 3D kirigami metamaterials. npj Comput. Mater. 2022;8:191. doi: 10.1038/s41524-022-00873-w. [DOI] [Google Scholar]
  • 98.Zener CM, Siegel S. Elasticity and anelasticity of metals. J. Phys. Chem. 1949;53:1468. doi: 10.1021/j150474a017. [DOI] [Google Scholar]
  • 99.Reuß A. Berechnung der fließgrenze von mischkristallen auf grund der plastizitätsbedingung für einkristalle. J. Appl. Math. Mech. 1929;9:49–58. [Google Scholar]
  • 100.Paszke, A. et al. Pytorch: an imperative style, high-performance deep learning library. Adv. Neural Inf. Process. Syst.32https://arxiv.org/abs/1912.01703 (2019).
  • 101.Bowman, S. R. et al. Generating sentences from a continuous space. arXivhttps://arxiv.org/abs/1511.06349 (2015).
  • 102.Shao, H. et al. Controlvae: controllable variational autoencoder. In: International Conference on Machine Learning, 8655–8664 (PMLR, 2020).
  • 103.Zheng, L. Truss dataset. Unifying the design space of truss metamaterials by generative modeling, 10.3929/ethz-b-000618078 (2023-06-23). [DOI] [PMC free article] [PubMed]
  • 104.Zheng, L. Unifying the design space and optimizing linear and nonlinear truss metamaterials by generative modeling. UnifyingTrussDesignSpace, 10.5281/zenodo.8255658 (2023-08-17). [DOI] [PMC free article] [PubMed]
  • 105.Mechanics and Materials Lab. ae108, 10.5905/ethz-1007-257 (2020).

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Peer Review File (27.5MB, pdf)
41467_2023_42068_MOESM3_ESM.pdf (48.3KB, pdf)

Description of Additional Supplementary Files

Supplementary Movie 1 (9.8MB, mov)
Supplementary Movie 2 (14.7MB, mov)
Supplementary Movie S (16.5MB, mov)
Supplementary Movie 5 (10.4MB, mov)

Data Availability Statement

The training data including truss structures and their effective homogenized properties generated in this study have been deposited in the ETHZ Research Collection103. Source data are provided with this paper.

The code used to train the generative modeling framework and obtain inverse designs of truss structures has been uploaded to Github104. The FE code used for homogenization in this study is available in the ae108 library105.


Articles from Nature Communications are provided here courtesy of Nature Publishing Group

RESOURCES