arXiv

The Impact of Temporal Granularity on Socio-Demographic Inference from Household Load Profiles

Title: How Temporal Resolution Affects Socio-Demographic Profiling via Household Energy Data

Smart meter recordings have the potential to expose intimate socio-demographic details about households, sparking significant privacy debates. Although the risk of such inferences has been established at specific time intervals, the specific influence of temporal resolution on inference accuracy has not been thoroughly investigated. This study fills that void by examining how varying levels of temporal granularity—ranging from 15-minute intervals up to weekly aggregates—impact the predictability of eight distinct socio-demographic traits. The analysis utilizes a dataset comprising one year of energy consumption records from 1,589 households.

To ensure robustness, we developed an evaluation framework in which machine learning classifiers were trained on full-year datasets but tested on specific, arbitrary weeks. This methodology compels the models to generalize effectively across both seasonal shifts and weekly patterns. The investigation yields three primary conclusions.

First, although reducing the temporal granularity generally lowers predictive accuracy, the results indicate two distinct performance plateaus. Accuracy remains consistent when moving from 15 minutes to one hour, and similarly stays stable between one day and seven days. These findings suggest viable pathways for data minimization that preserve analytical utility.

Second, the study finds that interpretable, handcrafted features and those generated by the tsfresh library perform competitively against embeddings derived from CNN-based autoencoders. Furthermore, the XGBoost algorithm consistently demonstrated superior performance compared to other tested classifiers.

Third, an analysis of feature importance reveals a divergence between static and dynamic attributes. Static characteristics, such as dwelling size, can be accurately inferred even from coarse-grained data. In contrast, dynamic behaviors, such as swimming pool usage, necessitate fine-grained temporal signals for reliable identification.

Ultimately, this research offers fresh perspectives on the balance between privacy and utility in smart metering systems. It demonstrates how the interplay of temporal resolution, feature extraction techniques, and classifier selection collectively determines the success of socio-demographic inference.


Source: arXiv Generated at: 2026-06-03 00:00:00 UTC

Related Articles

TikTok Billionaire Tops Ambani as Asia’s Second-Richest
Bloomberg

TikTok Billionaire Tops Ambani as Asia’s Second-Richest

TikTok founder surpasses Mukesh Ambani to become Asia’s second-richest person, marking a significant shift in the region...

Publishers in UK can opt out of Google AI search results
BBC News

Publishers in UK can opt out of Google AI search results

UK publishers can now opt out of Google’s AI search summaries, a CMA ruling designed to boost their bargaining power and...

Kioxia Edges Nearer Toyota’s Market Cap in Shakeup to Japan Inc.
Bloomberg

Kioxia Edges Nearer Toyota’s Market Cap in Shakeup to Japan Inc.

Kioxia’s market cap nears Toyota’s, signaling a major shift in Japan’s corporate hierarchy. This narrowing gap highlight...

Reuters

Morning Bid: Marvell, a fitting name for the latest AI darling

Reuters highlights Marvell as a top AI stock, noting its name perfectly suits its status as the newest market darling.

Financial Times

Tim Hayward: I built the Jaguar E-Type of computer keyboards

Tim Hayward compares his bespoke keyboard designs to the Jaguar E-Type. He explores high-end customization for personal ...

Financial Times

AI Labs: Zuckerberg’s $100bn gamble

Meta’s $100 billion AI investment aims to secure AI dominance, but questions remain whether sheer spending can outpace c...