物理与社会
查看 最近的 文章
显示 2025年07月18日, 星期五 新的列表
- [1] arXiv:2507.12585 [中文pdf, pdf, html, 其他]
-
标题: 社会吸引力对随意群体形成的影响:幂律群体规模和抑制渗流标题: The Impact of Social Attractiveness on Casual Group Formation: Power-Law Group Sizes and Suppressed Percolation主题: 物理与社会 (physics.soc-ph) ; 统计力学 (cond-mat.stat-mech) ; 适应性与自组织系统 (nlin.AO)
非正式群体形成的动态过程长期以来一直是社会科学的研究主题。 虽然早期的随机模型为群体规模分布提供了基础性见解,但它们常常简化了个体行为,并缺乏异质性社会吸引力的机制。 在此,我们重新审视了一个由吸引力驱动的交互模型,这是一个基于代理的框架,其中点状代理在二维场地中随机移动,并表现出不同的社会吸引力,从而导致它们靠近高度有吸引力的名人同伴。 我们将该模型与一个空模型进行比较,其中代理持续移动,这类似于随机几何图。 我们的大量模拟结果显示了显著的结构和动态差异:与空模型不同,吸引力驱动模型的平均度数随着系统大小线性增加,而密度固定,导致更紧凑的群体,并抑制了渗流转变。 至关重要的是,尽管空模型的群体规模分布是指数衰减或双峰的,但吸引力驱动模型稳健地表现出幂律分布,$P(n) \propto n^{-2.5}$,其指数与密度无关。 由于长平衡时间,这些发现是通过计算密集型模拟获得的,为该模型提供了详尽的定量表征,突显了个体吸引力在塑造物理空间中的社会聚集中的关键作用。
The dynamics of casual group formation has long been a subject of interest in social sciences. While early stochastic models offered foundational insights into group size distributions, they often simplified individual behaviors and lacked mechanisms for heterogeneous social appeal. Here, we re-examine the attractiveness-driven interaction model, an agent-based framework where point-like agents move randomly in a 2D arena and exhibit varied social appeal, leading them to pause near highly attractive celebrity peers. We compare this model to a null model where the agents are continuously in movement, which resembles a Random Geometric Graph. Our extensive simulations reveal significant structural and dynamic differences: unlike the null model, the attractiveness-driven model's average degree increases linearly with system size for fixed density, resulting in more compact groups and the suppression of a percolation transition. Crucially, while the null model's group size distribution is either exponentially decaying or bimodal, the attractiveness-driven model robustly exhibits a power-law distribution, $P(n) \propto n^{-2.5}$, with an exponent independent of density. These findings, obtained through computationally intensive simulations due to long equilibration times, offer a thorough quantitative characterization of this model, highlighting the critical role of individual attractiveness in shaping social aggregation in physical space.
- [2] arXiv:2507.12812 [中文pdf, pdf, 其他]
-
标题: 形式化非正式沟通:对CERN早期网络前预印本基础设施的考古学研究标题: Formalizing Informal Communication: An Archaeology of the Early Pre-Web Preprint Infrastructure at CERN主题: 物理与社会 (physics.soc-ph)
本文探讨了高能物理领域预印本通信的早期发展,特别是预印本通信在20世纪60年代初在欧洲核子研究中心(CERN)是如何被正式化的。 它采用了一种关于基础设施的社会学概念,以询问当时以及随后如何通过通信的实践和技术结构化了预印本在高能物理领域的使用,并最终固化为研究社区的通信和信息基础设施。 该研究对预印本媒介在三个历史层面的使用和理解进行了媒体考古,以探索成为欧洲核子研究中心预印本基础设施中刻入的社文文化特点:1. 在20世纪初,将预印本作为已被接受发表的论文的单独副本,非正式地分发给学会和学院的成员,以便内容能够立即可用,而不必等待出版延迟,2. 战后物理学中,预印本作为媒介,用于在理论物理快速发展的潮流中私下且非正式地分享实用指导和理论工具,3. 作为在20世纪60年代初欧洲核子研究中心图书馆开发的一种正式的信息系统,该系统将预印本视为面向整个高能物理(HEP)社区的公共当前意识工具,以向其成员通报该领域的最新进展。 文章最后讨论了塑造预印本通信早期互联网前历史的基础设施制度,并展望了全面研究高能物理领域预印本通信基础设施的研究计划。
This article deals with the early development of preprint communication in high-energy physics, specifically with how preprint communication was formalized in the early 1960s at CERN. It employs a sociological conception of infrastructures to ask which practices and technologies of communication structured the use of preprints in high-energy physics at the time and subsequently solidified into the research community's communication and information infrastructure. The investigation conducts a media archaeology of the uses and understandings of the preprint medium in three historical layers to explore the sociocultural specificities that became inscribed into the preprint infrastructure at CERN: 1. The use of preprints as separate copies of papers accepted for publication and distributed informally to members of societies and academies in the early-20th century to make the content immediately available without delay of publication, 2. The use of preprints in postwar physics as media to privately and informally share practical instructions and theoretical tools in the fast-moving current of theoretical physics, and 3. As a formalized information system developed at the CERN library in the early 1960s, which treated preprints as public current awareness tools for the benefit of the whole HEP community to inform its members of recent progress in the field. The article concludes with a discussion of the infrastructural regimes that shaped the early pre-Web history of preprint communication as well as an outlook to the research program to comprehensively study the infrastructuring of preprint communication in high-energy physics.
- [3] arXiv:2507.13005 [中文pdf, pdf, html, 其他]
-
标题: 运输网络增长中直接性与覆盖范围之间的权衡标题: The Trade-Off between Directness and Coverage in Transport Network Growth评论: 31页,21图主题: 物理与社会 (physics.soc-ph)
设计空间网络,如交通网络,通常涉及如何通过一组链接最佳地连接一组位置的问题。 在实践中,以一种有助于网络在增长过程中早期发挥作用的方式对链接进行排序可能是至关重要的,例如在自行车网络中。 然而,不清楚不同的增长过程如何实现这种早期功能结构。 在这里,我们系统地研究了连通平面网络的增长,量化了增长网络结构的功能性。 我们比较了随机增长与各种贪婪和人工设计的、手动增长策略。 我们通过直接性和覆盖率的基本性能指标来评估我们的结果,发现它们之间存在非平凡的权衡。 在两个指标上,人工策略都优于贪婪策略,而随机策略表现最差,并且不太可能是帕累托有效的。 基于中心性的贪婪策略在直接性方面表现最好,但在覆盖率方面比随机策略更差,而基于覆盖率的贪婪策略可以尽可能快地实现最大全局覆盖率,但在直接性方面表现与随机策略一样差。 基于直接性的贪婪策略会陷入局部最优陷阱。 这些结果适用于多种简化的城市交通网络拓扑结构。 我们的见解对于链接添加顺序在空间网络中很重要的应用至关重要,例如在城市或区域交通网络设计问题中。
Designing spatial networks, such as transport networks, commonly deals with the problem of how to best connect a set of locations through a set of links. In practice, it can be crucial to order the implementation of the links in a way that facilitates early functioning of the network during growth, like in bicycle networks. However, it is unclear how this early functional structure can be achieved by different growth processes. Here, we systematically study the growth of connected planar networks, quantifying functionality of the growing network structure. We compare random growth with various greedy and human-designed, manual growth strategies. We evaluate our results via the fundamental performance metrics of directness and coverage, finding non-trivial trade-offs between them. Manual strategies fare better than greedy strategies on both metrics, while random strategies perform worst and are unlikely to be Pareto efficient. Centrality-based greedy strategies tend to perform best for directness but are worse than random strategies for coverage, while coverage-based greedy strategies can achieve maximum global coverage as fast as possible but perform as poorly for directness as random strategies. Directness-based greedy strategies get stuck in local optimum traps. These results hold for a number of stylized urban transport network topologies. Our insights are crucial for applications where the order in which links are added to a spatial network is important, such as in urban or regional transport network design problems.
- [4] arXiv:2507.13010 [中文pdf, pdf, 其他]
-
标题: 量子在音乐节:关于量子科学与技术的展览对音乐节游客的影响标题: Quantum at a Music Festival: the Impact of an Exhibit about Quantum Science and Technologies on Festival Visitors评论: 19页,3图,1表主题: 物理与社会 (physics.soc-ph) ; 物理教育 (physics.ed-ph) ; 量子物理 (quant-ph)
量子技术被视为具有变革性,有可能在药物发现和机器学习等领域带来革命性的变化。 公众参与对于使这些发展与社会需求保持一致并促进接受度至关重要。 本研究测量了2024年Lowlands音乐节上关于量子技术展览的影响(n = 812)。 前后问卷评估了态度、担忧、兴趣和主观知识的变化。 结果显示主观知识有所增加,但兴趣有所下降,这可能是由于新颖性减少或感知难度增加所致。 这些发现强调了展览在非正式环境中作为推广工具的有效性,并突显了在未来的推广工作中保持新颖性和强调量子技术相关性的关键作用。 此外,我们强调评估推广效果的重要性,以确保目标成功实现。
Quantum technologies are seen as transformative, with a potential to revolutionize fields like drug discovery and machine learning. Public engagement is crucial to align these developments with societal needs and foster acceptance. This study measured the impact of an exhibit about quantum technologies at the 2024 Lowlands music festival (n = 812). Pre- and post-surveys assessed changes in attitude, concern, interest and subjective knowledge. Results showed an increase in subjective knowledge but a decrease in interest, possibly due to reduced novelty or increased perceived difficulty. These findings underscore the effectiveness of exhibits as outreach tools in informal settings and highlight the critical role of maintaining novelty and emphasizing the relevance of quantum technologies in future outreach efforts. Additionally, we emphasize the importance of assessing outreach effectiveness to ensure that objectives are successfully achieved.
- [5] arXiv:2507.13068 [中文pdf, pdf, html, 其他]
-
标题: 在线社交网络中的语言气泡标题: Language bubbles in online social networksAlessandro Bellina, Donald Ruggiero Lo Sardo, Emanuele Brugnoli, Fabio Saracco, Pietro Gravino, Vittorio Loreto, Gabriele Di Bona评论: 20页,7图主题: 物理与社会 (physics.soc-ph)
社交媒体平台已成为公共话语的重要空间。 尽管政治极化和不同群体之间的交流有限已被广泛认可,但社会网络碎片化与不同社区使用的语言特征和质量之间的联系却未得到足够的关注。 本研究旨在通过考察意大利在Twitter/X上的辩论的社会结构和语言丰富性来填补这一空白。 我们分析了2018年至2022年间意大利政治人物和新闻机构的推文和转发推文,描述了转发网络,并通过各种词汇度量评估了不同社区中使用的语言。 我们的分析揭示了两种系统性模式:在网络中更接近的社区倾向于使用更相似的词汇,而孤立的社区则始终表现出较低的词汇多样性和丰富性。 这些模式共同说明了我们所谓的“语言气泡”。 这些发现表明,社会隔离的社区与其他社区互动较少,并发展出独特且贫乏的语言特征,突显了社会碎片化与语言分歧之间的结构性联系。
Social media platforms have become essential spaces for public discourse. While political polarisation and limited communication across different groups are widely acknowledged, the connection between social network fragmentation and the language features and quality used by various communities has received insufficient attention. This study aims to fill this gap by examining the social structure and linguistic richness of the Italian debate on Twitter/X. We analyse tweets and retweets from Italian politicians and news outlets between 2018 and 2022, characterising the retweet network and evaluating the language used within different communities through various lexical metrics. Our analysis uncovers two systematic patterns: communities closer in the network tend to use more similar vocabulary, while isolated communities consistently demonstrate lower lexical diversity and richness. Together, these patterns illustrate what we call ``language bubbles''. These findings indicate that socially isolated communities interact less with others and develop distinct and poorer linguistic profiles, highlighting a structural link between social fragmentation and linguistic divergence.
- [6] arXiv:2507.13075 [中文pdf, pdf, html, 其他]
-
标题: 城市热岛效应中日季节性气温迟滞回线的波动模式:来自巴黎和马德里的见解标题: Undulating patterns of Hysteresis loops in diurnal seasonality of air temperature in Urban Heat Island effect: Insights from Paris and Madrid评论: 9页,3图主题: 物理与社会 (physics.soc-ph) ; 数据分析、统计与概率 (physics.data-an)
本研究通过对比分析巴黎和马德里两地的空气温度滞回模式,探讨城市热岛(UHI)效应的动力学特性。这两座主要欧洲城市具有不同的气候和城市特征。利用2008年至2017年以三小时为间隔的高分辨率模拟空气温度数据,我们研究了昼夜和季节性滞回环如何揭示UHI变化的独特性和普遍性方面。位于温带海洋性气候中的巴黎和位于寒冷半干旱区的马德里,在UHI强度、季节分布和昼夜模式方面表现出显著差异。尽管存在这些差异,这两座城市都表现出非常相似的滞回环方向和斜率,这表明太阳辐射和热量储存等时间依赖机制在不同城市背景下基本决定了空气温度的UHI。我们的研究结果强调了在制定有针对性的、具有气候适应能力的城市策略时,需要考虑本地气候和普遍的物理过程。这些结果为基于群体的干预措施和通过滞回模式对城市进行分类提供了途径,以指导城市规划和热缓解工作。
This study examines the dynamics of the urban heat island (UHI) effect by conducting a comparative analysis of air temperature hysteresis patterns in Paris and Madrid, two major European cities with distinct climatic and urban characteristics. Utilizing high-resolution modelled air temperature data aggregated at a fine temporal resolution of three-hour intervals from 2008 to 2017, we investigate how diurnal and seasonal hysteresis loops reveal both unique and universal aspects of UHI variability. Paris, located in a temperate oceanic climate, and Madrid, situated in a cold semi-arid zone, display pronounced differences in UHI intensity, seasonal distribution, and diurnal patterns. Despite these contrasts, both cities exhibit remarkably similar hysteresis loop directions and slopes, suggesting that time-dependent mechanisms such as solar radiation and heat storage fundamentally govern air temperature UHI across diverse urban contexts. Our findings underscore the importance of considering both local climate and universal physical processes in developing targeted, climate-resilient urban strategies. The results pave the way for group-based interventions and classification of cities by hysteresis patterns to inform urban planning and heat mitigation efforts.
- [7] arXiv:2507.13310 [中文pdf, pdf, html, 其他]
-
标题: 在线参与对线下抗议的溢出效应建模:网络上的随机动力学和平均场近似标题: Modelling the spillover from online engagement to offline protest: stochastic dynamics and mean-field approximations on networks评论: 44页,33图主题: 物理与社会 (physics.soc-ph) ; 社会与信息网络 (cs.SI) ; 动力系统 (math.DS) ; 适应性与自组织系统 (nlin.AO) ; 种群与进化 (q-bio.PE)
社交媒体正在改变线下生活的各个方面,从日常决策如用餐选择到冲突的发展进程。 在本研究中,我们提出了一种耦合建模框架,包含一个在线社交网络层,以分析特定主题上的参与如何溢出到线下抗议活动。 我们开发了一个随机模型,并推导了几种不同复杂度的平均场模型。 这些模型使我们能够估计繁殖数并预测活动激增可能发生的时间。 一个关键因素是在线和线下领域之间的传播率;为了出现线下爆发,这一比率必须处于一个临界范围内,既不太低也不太高。 此外,利用合成网络,我们研究了网络结构如何影响这些近似值的准确性。 我们的研究结果表明,低密度网络需要更复杂的近似,而简单的模型可以有效地表示高密度网络。 然而,在两个现实世界的网络上进行测试时,增加复杂度并未提高准确性。
Social media is transforming various aspects of offline life, from everyday decisions such as dining choices to the progression of conflicts. In this study, we propose a coupled modelling framework with an online social network layer to analyse how engagement on a specific topic spills over into offline protest activities. We develop a stochastic model and derive several mean-field models of varying complexity. These models allow us to estimate the reproductive number and anticipate when surges in activity are likely to occur. A key factor is the transmission rate between the online and offline domains; for offline outbursts to emerge, this rate must fall within a critical range, neither too low nor too high. Additionally, using synthetic networks, we examine how network structure influences the accuracy of these approximations. Our findings indicate that low-density networks need more complex approximations, whereas simpler models can effectively represent higher-density networks. When tested on two real-world networks, however, increased complexity did not enhance accuracy.
新提交 (展示 7 之 7 条目 )
- [8] arXiv:2507.12495 (交叉列表自 physics.geo-ph) [中文pdf, pdf, 其他]
-
标题: 评估空间天气缓解投资决策的经济效益:来自新西兰的证据标题: Assessing the economic benefits of space weather mitigation investment decisions: Evidence from Aotearoa New Zealand主题: 地球物理 (physics.geo-ph) ; 系统与控制 (eess.SY) ; 等离子体物理 (physics.plasm-ph) ; 物理与社会 (physics.soc-ph) ; 空间物理 (physics.space-ph)
空间天气事件对现代经济构成日益增长的威胁,但其宏观经济影响仍研究不足。 本研究首次针对奥特亚罗瓦新西兰的地磁风暴影响进行了专门的经济评估,量化了由于极端日冕物质抛射(CME)导致的七种中断和缓解情景下的潜在GDP损失。 主要关注地磁感应电流(GICs)对电力传输网络的破坏性影响。 目标是通过提供其潜在经济效益的一阶近似值,支持围绕空间天气缓解投资的决策。 我们发现,在没有缓解措施的情况下,一次严重但现实的风暴可能导致高达83.6亿新元的GDP损失,其中一半以上来自连锁供应链效应。 然而,即使较不严重的场景也会造成超过30亿新元的损失。 重要的是,以研究为导向的操作策略,如优化切换和孤岛运行,仅需花费50万新元即可避免高达3.7亿新元的损失,带来740比1的效益成本比率。 此外,物理防护措施如GIC阻断设备可将中断降低至11.2亿新元,避免的GDP损失高达23亿新元,效益成本回报率高达80比1。 当考虑到未建模的影响,包括数十亿资本设备损失和长期收入损失时,预先缓解的经济合理性变得更加重要。 未来的研究需要将资本和收入损失的建模整合到战略重要的工业设施中。
Space weather events pose a growing threat to modern economies, yet their macroeconomic consequences still remain underexplored. This study presents the first dedicated economic assessment of geomagnetic storm impacts on Aotearoa New Zealand, quantifying potential GDP losses across seven disruption and mitigation scenarios due to an extreme coronal mass ejection (CME). The primary focus is upon the damaging impacts of geomagnetically induced currents (GICs) on the electrical power transmission network. The goal is to support decision-making around space weather mitigation investments by providing a first-order approximation of their potential economic benefits. We find that in the absence of mitigation, a severe but realistic storm could result in up to NZ\$8.36 billion in lost GDP, with more than half stemming from cascading supply chain effects. Yet, even less severe scenarios incur losses exceeding NZ\$3 billion. Importantly, research-led operational strategies, such as optimized switching and islanding, can avoid up to NZ\$370 million in losses for as little as NZ\$500,000 in expenditure, delivering a benefit-cost ratio of 740 to 1. Moreover, physical protections such as GIC blocking devices further reduce disruption to as low as NZ\$1.12 billion, with avoided GDP losses up to NZ\$2.3 billion, and benefit-cost returns up to 80 to 1. When also acknowledging unmodelled impacts, including multi-billion losses in capital equipment and long-term revenue, the economic rationale for pre-emptive mitigation becomes even more pertinent. Future research needs to integrate the modelling of capital and revenue losses for strategically important industrial facilities.
交叉提交 (展示 1 之 1 条目 )
- [9] arXiv:2505.22802 (替换) [中文pdf, pdf, html, 其他]
-
标题: 从有符号网络到群图标题: From Signed Networks to Group Graphs评论: 54页,包括附录中的13页。第2版增加了更多的应用,并新增了关于电压图和增益图的参考文献。主题: 物理与社会 (physics.soc-ph) ; 离散数学 (cs.DM) ; 社会与信息网络 (cs.SI)
我定义了一个“群图”,它编码了网络上动态过程中的对称性。 群图扩展了带符号网络,其中边用加号或减号标记,通过允许来自任何群的边标签,并推广了标准的平衡概念。 我证明,在平衡群图上的过程中,时间演化完全由网络拓扑决定,而不是由群结构决定。 这统一并扩展了最近关于带符号网络(Tian & Lambiotte, 2024a)和复杂网络(Tian & Lambiotte, 2024b)的研究成果。 我还将讨论此处的结果与相关工作联系起来,例如Harary(1982)的“群图”、Gross(1974)的“电压图”和Zaslavsky(1989)的“增益图”。 最后,我将回顾一些网络动力学和对称驱动建模的有前途的应用,包括地位、带有零标签的边、弱平衡、不平衡群图以及使用独异半群。
I define a "group graph" which encodes the symmetry in a dynamical process on a network. Group graphs extend signed networks, where links are labelled with plus or minus one, by allowing link labels from any group and generalising the standard notion of balance. I show that for processes on a balanced group graph the time evolution is completely determined by the network topology, not by the group structure. This unifies and extends recent findings on signed networks (Tian \& Lambiotte, 2024a) and complex networks (Tian \& Lambiotte, 2024b). I will also relate the results discussed here to related work such as the "group graph" of Harary (1982), a "voltage graph" (Gross, 1974) and a "gain graph" (Zaslavsky 1989). Finally, I will review some promising applications for network dynamics and symmetry-driven modelling including status, edges with a zero label, weak balance, unbalanced group graphs and using monoids.
- [10] arXiv:2506.20572 (替换) [中文pdf, pdf, html, 其他]
-
标题: 一种生物物理方法用于通信系统的网络设计标题: A biophysical approach to the design of networks of communication systems评论: 7页主题: 物理与社会 (physics.soc-ph) ; 适应性与自组织系统 (nlin.AO)
受原生动物\textit{多核黏菌}生长动力学的启发,我们采用了一种形式化方法,用于描述在通道网络上的适应性、不可压缩的哈根-泊肃叶流动,以识别欧几里得空间中连接不同节点的图。这些图相对于其长度来说要么是次优的,要么是最优的。有时,我们会推导出与史特林树拓扑等价的图树结构。这种方法可以用于辅助决策通信网络的设计,例如光纤网、高速公路或铁路网络。作为该方法实用性的展示,我们明确地将这一框架应用于葡萄牙铁路网络。
Inspired by the growth dynamics of the protist \textit{Physarum polycephalum}, we employ a formalism that describes adaptive, incompressible Hagen-Poiseuille flows on channel networks to identify graphs connecting different nodes within Euclidean space. These graphs are either suboptimal or optimal with respect to their length. Occasionally, we derive graph tree configurations that are topologically equivalent to Steiner trees. This methodology can be utilised to assist in making decisions regarding the design of communication networks, such as fibre webs, motorways, or railway networks. As a demonstration of the practicality of this approach, we explicitly apply this framework to the Portuguese railway network.
- [11] arXiv:2501.05365 (替换) [中文pdf, pdf, html, 其他]
-
标题: 非稳态尾部在动力学流行病模型中的控制标题: Control of Overpopulated Tails in Kinetic Epidemic Models主题: 优化与控制 (math.OC) ; 适应性与自组织系统 (nlin.AO) ; 物理与社会 (physics.soc-ph) ; 种群与进化 (q-bio.PE)
我们为数学流行病学中的受控分 compartmental 模型引入基于模型的转移率,重点研究控制策略对描述接触形成动力学的相互作用多智能体系统的影响。 在动能控制问题的框架下,我们比较两种典型的控制协议:一种是直接影响动力学的加性控制,另一种是针对智能体之间相互作用强度的控制。 对于 SIR 分 compartmental 化,推导出出现的受控宏观模型,以说明其对流行病进展和接触相互作用动力学的影响。 数值结果表明,这种方法在引导动力学和控制流行病趋势方面的有效性,即使在接触分布表现出过度填充尾部的情况下也是如此。
We introduce model-based transition rates for controlled compartmental models in mathematical epidemiology, with a focus on the effects of control strategies applied to interacting multi-agent systems describing contact formation dynamics. In the framework of kinetic control problems, we compare two prototypical control protocols: one additive control directly influencing the dynamics and another targeting the interaction strength between agents. The emerging controlled macroscopic models are derived for an SIR compartmentalization to illustrate their impact on epidemic progression and contact interaction dynamics. Numerical results show the effectiveness of this approach in steering the dynamics and controlling epidemic trends, even in scenarios where contact distributions exhibit an overpopulated tail.
- [12] arXiv:2502.02386 (替换) [中文pdf, pdf, html, 其他]
-
标题: 通过超边复制的超图链接预测标题: Hypergraph Link Prediction via Hyperedge Copying主题: 社会与信息网络 (cs.SI) ; 适应性与自组织系统 (nlin.AO) ; 数据分析、统计与概率 (physics.data-an) ; 物理与社会 (physics.soc-ph)
我们提出了一种时间演化的超图生成模型,其中超边通过复制之前的超边而形成。 我们的模型能够再现许多经验超图中的几种典型事实,可以从数据中学习,并在完整的超图上定义似然,而不是基于自我的或其他子超图。 分析我们的模型,我们得出了节点度、边大小和边交集大小分布的描述,这些分布是根据模型参数得出的。 我们还展示了经验超图的一些特征,这些特征是或不是被我们的模型成功捕捉的。 我们提供了一个可扩展的随机期望最大化算法,可以将我们的模型拟合到包含数百万个节点和边的超图数据集上。 最后,我们在超图链接预测任务上评估了我们的模型,发现仅使用11个参数的模型实例就可以与大型神经网络达到具有竞争力的预测性能。
We propose a generative model of temporally-evolving hypergraphs in which hyperedges form via noisy copying of previous hyperedges. Our proposed model reproduces several stylized facts from many empirical hypergraphs, is learnable from data, and defines a likelihood over a complete hypergraph rather than ego-based or other sub-hypergraphs. Analyzing our model, we derive descriptions of node degree, edge size, and edge intersection size distributions in terms of the model parameters. We also show several features of empirical hypergraphs which are and are not successfully captured by our model. We provide a scalable stochastic expectation maximization algorithm with which we can fit our model to hypergraph data sets with millions of nodes and edges. Finally, we assess our model on a hypergraph link prediction task, finding that an instantiation of our model with just 11 parameters can achieve competitive predictive performance with large neural networks.