arXiv

Efficient Synthetic Network Generation via Latent Embedding Reconstruction

Title: Efficient Synthetic Network Generation via Latent Embedding Reconstruction

Network datasets are prevalent across disciplines ranging from the social sciences and biology to information systems. The ability to generate realistic synthetic network data holds significant promise for applications spanning network simulation to scientific discovery. However, current black-box generation methods often struggle with practical limitations: they tend to overfit observed data, neglect essential structural characteristics, and demand high computational resources as scale increases. To address these issues, there is a pressing need for synthetic generation techniques that are both computationally efficient and capable of accurately capturing network structures.

This paper presents SyNGLER (Synthetic Network Generation via Latent Embedding Reconstruction), a general and efficient framework grounded in latent space network models. The SyNGLER process begins by analyzing an observed network to learn low-dimensional latent node embeddings through a latent space model. Subsequently, it reconstructs the latent space by employing a distribution-free generator based on these embeddings. To generate synthetic data, SyNGLER samples or resamples node embeddings from this generator within the latent space, which are then used by the latent space network model to produce the final synthetic networks.

By leveraging this latent space approach, SyNGLER maintains distinct network features, including sparsity and heterogeneity in node degrees, while offering more efficient training with reduced computational costs compared to many contemporary deep learning architectures. We establish theoretical validity by deriving consistency results regarding the distance between the true edge distributions and those of the synthetic networks. Empirical evaluations confirm SyNGLER’s effectiveness, showing that it efficiently generates networks that better retain key structural attributes—such as degree distributions and network moments—than existing methods. The source code is publicly available at https://github.com/FeifanJiang/syngler.


Source: arXiv Generated at: 2026-06-02 00:00:00 UTC

Related Articles

Law’s Billable Hour Is Being Shredded by AI
Bloomberg

Law’s Billable Hour Is Being Shredded by AI

AI is dismantling the billable hour by automating routine legal tasks. This technological shift threatens the traditiona...

Iran War: Trump Tries to Stop Israel’s Lebanon Push | The Opening Trade 6/2/2026
Bloomberg

Iran War: Trump Tries to Stop Israel’s Lebanon Push | The Opening Trade 6/2/2026

SoftBank in Early Talks to Back $800 Million Agile Robots Round
Bloomberg

SoftBank in Early Talks to Back $800 Million Agile Robots Round

SoftBank is in early talks to back Agile Robots’ $800 million funding round. The Japanese tech giant is currently in pre...

Amundi Is Diversifying Risk Via Commodity Currencies, Gold
Bloomberg

Amundi Is Diversifying Risk Via Commodity Currencies, Gold

Amundi diversifies risk by investing in commodity-linked currencies and gold. This strategy hedges against market volati...

Reuters

Marvell Technology surges after Nvidia's Huang calls it 'next trillion-dollar company'

Marvell Technology shares surged after Nvidia CEO Jensen Huang labeled the firm the “next trillion-dollar company.”

Russia Says It Found Foreign Spyware on Top Officials’ Phones
Bloomberg

Russia Says It Found Foreign Spyware on Top Officials’ Phones

Russia’s FSB claims to have discovered foreign spyware on senior officials’ phones. Moscow attributes the intrusion to h...