G2Rec: Tokenizing User Interest Graphs for Scalable Generative Recommendation
A new framework unifies graph-based co-engagement modeling with semantic tokenization to predict user behavior at industrial scale, with direct applications to on-chain intent inference and DeFi personalization.
Generative recommendation is an emerging paradigm that aims to predict users’ next interactions from their historical behaviors, with item tokenization at its core bridging item semantics and recommendation models. However, existing graph-based integration methods, such as graph serialization and graph neural networks, either suffer from scalability issues or exploit only local graph information. Similarly, existing semantic tokenization methods typically rely on heuristics and lack explicit supervision signals, which may lead to inaccurate or suboptimal semantic representations.
To address these limitations, researchers propose G2Rec, a scalable framework that unifies holistic graph-based user co-engagement modeling with semantic tokenization for industrial-scale generative recommendation. By jointly learning from user-item interaction graphs and semantic item tokens, G2Rec enables recommendation models to capture holistic and semantically grounded user interest prototypes without requiring ground-truth user interests. This provides more comprehensive and accurate modeling of user behavior contexts in industrial sequential recommendation. The framework has been deployed online across product surfaces and validated through extensive experiments on public datasets, demonstrating superiority over existing methods.
For the crypto and blockchain domain, G2Rec’s approach offers several concrete implications. First, the generative tokenization of user-item interactions can be directly applied to on-chain transaction sequences. By treating each transaction or smart contract call as a ‘token’ in a generative model, one can predict the next state transition (e.g., next swap, liquidation, or cross-chain message) from historical on-chain data. This could replace heuristic mempool strategies with learned generative models for MEV searchers or intent solvers.
Second, G2Rec’s graph-based co-engagement modeling can scale to millions of wallet addresses without ground-truth labels. DeFi protocols could construct co-engagement graphs of wallet addresses (who transacts with whom, which pools they share) to tokenize ‘user interest contexts’ for personalized liquidations, yield strategies, or credit scoring. This unsupervised approach is particularly valuable in permissionless blockchains where users never explicitly declare their intent.
Third, the semantic tokenization of items into a unified vocabulary can be extended to tokenize smart contract interactions, NFT metadata, or token symbols. This enables a new class of ‘tokenized blockchain state’ models that predict next interactions as a generative language modeling task, with applications in fraud detection, wallet UX, and automated DeFi agents.
Finally, G2Rec’s proven online deployment at industrial scale suggests that similar architectures could be deployed as on-chain or off-chain inference pipelines for a ‘recommendation layer’ in DeFi—e.g., suggesting optimal swap routes, yield pools, or NFT bids based on a wallet’s tokenized on-chain history, with the scalability to handle Ethereum’s full historical transaction volume.
Evidence & Provenance
Every claim is hash-locked to its source span. Click any [N] marker above to verify.
Claim 76 Generative recommendation is an emerging paradigm that aims to predict users' next interactions from their historical behaviors, with item tokenization at its core bridging item semantics and recommendation models.
Generative recommendation is an emerging paradigm that has shown promise in industrial recommendation systems, aiming to predict users' next interactions from their historical behaviors. At the core of generative recommendation lies item tokenization, which bridges item semantics and recommendation models.
260ab39169d9ced6178f36830f24115afba671a2bac907b2566726729860b7a4 Claim 77 Existing graph-based integration methods (graph serialization, graph neural networks) either suffer from scalability issues or exploit only local graph information.
existing graph-based integration methods, such as graph serialization and graph neural networks, either suffer from scalability issues or exploit only local graph information
425317aecd07b962bc22d318e6c2169fb2d5498cf9af02ddb627a818084661c9 Claim 78 Existing semantic tokenization methods typically rely on heuristics and lack explicit supervision signals, leading to inaccurate or suboptimal semantic representations.
existing semantic tokenization methods typically rely on heuristics and lack explicit supervision signals, which may lead to inaccurate or suboptimal semantic representations
4c75f01a444f3fece95dcc0eab3f640b6d5e865fbee5372a604a889b3803ae20 Claim 79 G2Rec is a scalable framework that unifies holistic graph-based user co-engagement modeling with semantic tokenization for industrial-scale generative recommendation.
we propose G2Rec, a scalable framework that unifies holistic graph-based user co-engagement modeling with semantic tokenization for industrial-scale generative recommendation
c692a4ff3205be0de3362576d0fb4ff17c4f673d10a716a234c5cf8d65d86a70 Claim 80 G2Rec enables recommendation models to capture holistic and semantically grounded user interest prototypes without requiring ground-truth user interests.
G2Rec enables recommendation models to capture holistic and semantically grounded user interest prototypes without requiring ground-truth user interests
b386d5911ca201180a1c1cc6d5d027f0fc4fc725ca54ff44aceb11b2cf0bb33a Claim 81 G2Rec provides more comprehensive and accurate modeling of user behavior contexts in industrial sequential recommendation.
thereby providing more comprehensive and accurate modeling of user behavior contexts in industrial sequential recommendation
70addc4a460270e16e73e82a53039086df39db055811cf61be2961272ce36b23 Claim 82 G2Rec has been deployed online across product surfaces and validated through extensive experiments on public datasets, demonstrating superiority over existing methods.
Online deployment across product surfaces and extensive experiments on public datasets demonstrate the superiority of G2Rec over existing methods.
bd857115b0ebe937e0f20ce9d034c8f70de06351aa62661fb583eabc568dbe98