阅读视图

观澜·驻访学者沙龙（第14期）回顾 | 人工智能与人类智能

2026年4月17日 15:04

2026-04-17 15:04 广东

4月12日，观澜·驻访学者沙龙（第14期）在我校深圳校区文学园5栋103致用·观澜学术会议室顺利举行。本次研讨会由我院2026年春季学期校内驻访学者周国梅教授和刘虎教授共同召集，汇聚心灵哲学、逻辑学、心理学、智能工程及产业界的专家学者，深度剖析人工智能（AI）与人类智能（HI）的本质差异，探讨人类认知研究在人工智能发展中的应用价值，探索人机协作的优化模式、人机融合社会的未来图景。

周国梅教授、刘虎教授作引言

人类智能

在中山大学心理学系代政嘉教授主持的“人类智能”主题研讨中，研究者们从实验心理学与神经科学视角审视了人类认知的独特性。

中山大学心理学系钱洁慧教授通过三维空间工作记忆的研究发现，人类智能在处理空间信息时展现出显著的特异性。她认为，物体间的空间关系、远近距离及可操作性均会影响信息的编码与表征，这提示了人类的信息加工机制由具身经验塑造，这一点可能与当前AI的信息处理逻辑存在差异。

中山大学心理学系岳珍珠教授则聚焦于跨通道视听整合的认知机制。她介绍，在单通道信息较弱时，多通道整合的神经元反应模式比单通道信息引发的效应更强，这一发现为理解人类感知系统的复杂性和适应性提供了新视角。

中山大学心理学系林盈教授以“以脑‘知’脑：脑科学与AI的双向赋能初探”为题，分享了脑科学与AI融合的前沿探索。报告围绕"AI for Brain"与"Brain for AI"两条主线展开：利用AI技术建模人脑结构-功能耦合关系，揭示人脑异质性通信机制；同时借助AI模型基于脑功能活动重建视觉图像，探索脑科学理论发现如何提升AI模型。该报告展现了脑科学与AI协同创新的广泛前景，引起了与会专家对这一领域的热烈探讨。

左右滑动查看第一场研讨现场

人类智能与人工智能

中山大学心理学系王琪教授主持的“人类智能与人工智能”主题研讨中，与会专家重点探讨了AI在医疗手术、智能驾驶及工业设计等领域的创新应用和落地实践。

中国科学院深圳先进技术研究院贾富仓研究员介绍了具身智能人机协同手术机器人的最新进展。报告了解剖结构、动态环境感知和增强现实导航对于提升手术安全性和精准性的临床验证效果，提出结合仿真数据强化学习和临床数据模拟学习提升机器人的自主性。

中山大学心理学系何子静副教授分享了自动驾驶混合交通中的人机交互研究。她认为，未来的自动驾驶不应仅追求工程效能，更应关注拟人化感知与动态博弈，通过适度让渡控制权来增强人类的心理接受度。

深圳人因工程技术研究院李海波院长从人因工程角度介绍了复杂作业系统人机交互的挑战。他强调，智能系统设计需充分考虑用户认知模型的预测价值，并展示了人因工程在核电应急仿真、数字化界面设计及海洋立体生态评估大模型中的应用前景。

中山大学心理学系罗思阳副教授则从文化组学视角切入，探讨了AI大模型的高维心理结构、道德价值观和文化价值观，及其在人机交互中的刻板印象与偏见，并通过交叉融合表征相似性建模、个体行为建模、社会演化建模，检验了未来导向、命运共同体等多种调控AI大模型交互行为的策略，解析了未来人-AI多智能体融合社会的演化趋势。

左右滑动查看第二场研讨现场

交流讨论

在自由讨论环节，香港中文大学哲学系张寄冀教授、香港中文大学政务与政策科学学院詹晶教授、香港大学哲学系王康予讲师、中山大学马克思主义学院李珍教授、中山大学逻辑与认知研究所文学锋教授、中山大学外国语学院徐翌茹教授，中山大学心理学系丁如一副教授、中山大学历史学系李智副教授、深圳大学心理学院黄丽芹助理教授等与会学者踊跃发言，围绕智能的能动性、情绪智能、心理健康、具身智能、群体智能、自我意识、政策制定等核心议题展开了深入探讨，分享了各自的前沿见解。

左右滑动查看交流讨论现场

总结

周国梅教授和刘虎教授在总结发言中表示，本次“人工智能与人类智能”学术沙龙取得了丰硕成果，成功搭建了横跨人文、社科、工程、产业等多领域的跨界交流平台，有效促进了哲学思辨、心理科学、工程技术与产业应用的深度对话与思想交融。同时，他们强调，人工智能的发展不应仅追求技术突破，更应注重人文关怀，期待未来能够持续举办此类活动，不断推动人工智能领域向着更具人性温度的方向演进，共同探索人机协同、和谐共生的未来发展路径。

研讨会合影留念

内容来源：中山大学人文社会科学研究院

编辑：陈旖旎

初审：蔡一峰

复审：林耿、陈诗诗

审定发布：张伟

阅读原文

跳转微信打开

10 - 武汉大学文化遗产公众号

【光明日报】余来明王玉珏｜以文化遗产讲好中华文明故事

2026年4月17日 19:25

2026-04-17 19:25 美国

讲好文化遗产的中国故事，可以让世界更好了解中华文明几千年延续不辍的文化基因、精神系谱，提供给世界多样文明治理、文化发展的方案和模式，从而更好地书写中华文明未来的世界篇章。

👆👆👆点击蓝字 · 关注我们

文化遗产是每一个民族在文明发展过程中留下的宝贵财富。纵观人类文明发展历程，无论文明存续抑或消失，通过传之于世的文化遗产，人们总能由此想象其在文明历史星空中所呈现的形态和图景。中华文明作为世界历史上“连续性文明”的典范，在五千多年的发展历程中，创造了辉煌灿烂的文明成果，为世界文明发展贡献了数量丰富的文化遗产。当下世界文明的发展正呈现多样化格局，中华文明作为其中重要一支，也在一定程度上面临着来自各方面的多重挑战。中华文明绵延不辍的历史表明，文明发展的未来不仅在于做到丰盈自身，树立具有主体性的文明意识，探索推动文明不断进步的动力和方向，还在于与世界诸文明在交流交往中相互借鉴、共同发展。文化遗产作为一种兼具历史性与当代性的文明记忆，为讲好五千多年中华文明故事提供了重要的文化载体和媒介。

文化遗产蕴含五千多年中华文明绵延生长的历史基因

中华文明有着五千多年的悠久历史，是人类历史上迄今为止唯一未曾断裂的文明形态。中华民族创造了辉煌的文明成就，留下了无数珍贵的历史文化遗产，并以之影响了周边地区和世界各地。其间无论是承载中华文明杰出创造的物质遗产，还是反映中华文明优秀特质的精神遗产，都形成于几千年中华文明的悠久历史进程当中，是凝结无数先辈心血和智慧的优秀文明实践成果。那些保存至今的文化遗产，是我们进入中华几千年活态历史的重要通道，其中蕴藏着中华文明绵延生长的历史基因和密码。遗产见证历史，是“传统的集合”，是现实世界与过去时代最紧密、最直接的联系。当我们近距离触摸、感知文化遗产，打开的是通向中华文明悠久历史、璀璨成就的广阔世界。

文化的发展、文明的进步需要源头活水。《周易》说：“穷则变，变则通，通则久。”此论不仅道出了世间万物的存续之道，同时也指出了人类文明的兴亡之道。人类历史上曾出现诸多伟大的文明形态，最后又大多消失在了历史长河中。唯有中华文明一直延续至今，其中一个重要特点就是能够吸收、容纳各不同形态文明的优秀基因，以为自身文明的发展提供永续动力。其中既有吸收外来文明以实现转化的容纳中外，也有得自于传承创新而生的古今转化。文化遗产作为各历史时期文化发展、文明演进的重要标识，是呈现五千多年中华文明发展成就的鲜活载体。文化遗产不只关乎中华文明的历史和过去，更与中华文明的现在和未来密切相关。保护、传承、利用好文化遗产，就是要立足几千年文明赓续的历史脉络，深入挖掘中华文明绵延不绝的基因密码，以大众可知可感的方式讲好中华民族几千年绵延发展的文明故事，发掘其中所蕴含的中国智慧和中国精神。

传承发展中华文明要落实在保护传承、活化利用文化遗产行动中

文化遗产是几千年中华文明发展给世界留下的宝贵财富，是中华民族向世界展现悠久文明历史的重要窗口。相较于世界其他诸文明，中华文明具有突出的连续性、创新性、统一性、包容性与和平性。这些特征不仅见于历史时期诸多观念、意识的表述，也具体呈现于各种物质、非物质的文化遗产当中。以往对于中华文明观念、意识层面的内容、特征分析较多，而对于如何将几千年中华文明的优秀基因外化于当代世界则实践较少。每一民族文明的延续，都建立在对自身文化的反思、总结、革新、完善的基础之上。当下随着数智时代的到来，技术变革正在不断推动文化的革新，也对弘扬传承中华文明提出了新的要求。文化遗产作为中华文明悠久历史的结晶，其保护传承、活化利用在数字化的时代趋势下也在不断重塑，面临着现代技术和古老文明之间的多重挑战与复杂的融合重生过程。

文化遗产作为一种活态的文化，承载的不仅是历史时期文明演进的记忆，也是民族性格、精神传承的重要载体。每一民族文明的传承、文化的发展，都是在原有文明脉络的基础上赓续变革而来。中华文明在几千年发展过程中，善于总结既往的文明成果并推陈出新，由此所凝定形成于不同时期的文化遗产，便不只是一种物质性或者精神性的遗存，还是反映中华民族伟大精神和智慧的历史结晶。如何让文物古迹、古老建筑、名城名镇、历史街区、传统村落、文化景观、非遗民俗等文化遗产在现代世界重新焕发生命力，让古老的中华文明焕发新的活力，是中华文明绵延发展所需要面临的重要课题。从此出发，文化遗产的保护传承、活化利用便不仅是文化资源的开发、利用或者产业化，还是要在中华文明不断前进的历程中，承担起弘扬中华文化价值、传承中华文明基因的重任。

文化遗产保护传承是中华文明参与世界文明对话、确立文明自主性的重要基础

当今世界正进入一个多元并存的时代，各不同文明之间既相互合作又相互竞争，诸不同形态文明的多样共存、互鉴对话、协同发展成为世界文明演进的大势。中华文明作为世界多元文明中的重要一支，在历史上曾长期居于领先地位，至近代遭遇巨大挫折。如今，中华民族伟大复兴势不可挡。在不断推进中国式现代化进程中，面对“世界百年未有之大变局”，中华文明、中华文化如何做到重塑辉煌，找准文明发展的坐标和方向尤为关键，其间不仅包含面向自身文化建设、文明发展的历史传承与创新转化，也包括应对参与世界文明对话、确立文明自主性的全球文明新观念。

文化遗产作为每一文明呈现于世界的具象表达，是各不同文明参与世界对话的重要基础。文化遗产的保护传承是人类文明发展留给这个世界的共同命题。从联合国教科文组织成立以来出台的各种文化遗产保护法律和规则，到各国规范文化遗产保护的相关法规，这都为文化遗产的保护传承提供了制度保障。这不仅是一种面向过去文明记忆的保存，也是一种面向未来文明图景的探索。基于文化遗产保护传承所呈现的世界文明多样性，可以为不同文明间相互尊重、交流对话提供重要媒介，这是消除世界各民族“误解的海洋”最重要的方式之一。中华文明走向世界的过程中，文化遗产成为展现悠久历史、深厚底蕴、丰富内涵、伟大精神的重要窗口。讲好文化遗产的中国故事，可以让世界更好了解中华文明几千年延续不辍的文化基因、精神系谱，提供给世界多样文明治理、文化发展的方案和模式，从而更好地书写中华文明未来的世界篇章。

（作者：余来明、王玉珏，分别系武汉大学中国传统文化研究中心教授；武汉大学文化遗产智能计算实验室教授）

本文来源：《光明日报》2026 年 4 月 11 日

<<<END>>>

来源｜武汉大学中国传统文化研究中心

武汉大学人文社科楼

阅读原文

跳转微信打开

20 - 意大利人文信息学与数字文化协会（AIUCD）

Reminder: Date importanti AIUCD 2026

作者christiandagata

2026年4月18日 03:18

Si ricorda a tutti coloro che intendono partecipare ad AIUCD 2026 le seguenti scadenze improrogabili:

Autori

Iscrizione al Convegno: entro il 19 aprile 2026
Invio della versione camera ready: entro il 22 aprile 2026

Uditori

Iscrizione al Convegno: entro il 17 maggio 2026

L’iscrizione è obbligatoria per la pubblicazione del contributo negli atti e per la partecipazione come uditori.

L'articolo Reminder: Date importanti AIUCD 2026 proviene da AIUCD.

12 - 南京师大比特人文公众号

2026年“我用AI校古籍” | 我是“校书官”古籍大众智能整理计划启动啦

2026年4月18日 09:01

徐惠 2026-04-18 09:01 江苏

2026年AI校古籍计划启动，邀公众参与古籍数字化整理，设奖励与证书激励参与。

转载自“数字人文开放实验室”

比特人文

投稿邮箱：dhbase@126.com

扫码关注获取更多资讯

阅读原文

跳转微信打开

5 - 武汉大学数字人文公众号

学术前沿丨《数字人文学刊 (DSH)》2026年第1期论文荐读

2026年4月18日 08:30

2026-04-18 08:30 湖北

本期内容将选取《数字人文学刊》2026年第1期（Volume 41,Issue 1,April 2026）中的10篇论文进行介绍。

《数字人文学刊》（Digital Scholarship in the Humanities，简称DSH）是一本国际性的、同行评审的期刊，发表关于人文学科中所有数字学术方面的原创研究，包括但不限于当前被称为数字人文学科的领域。该期刊主要发表长篇和短篇论文报告，理论、方法、实验和应用研究以及书评等。本期内容将选取《数字人文学刊》2026年第1期（Volume 41,Issue 1,April 2026）中的10篇论文进行介绍。

识读不可读文本：基于图像转文本语言模型构建 19 世纪英文报纸数据集

Reading the unreadable: creating a dataset of 19th century English newspapers using image-to-text language models

Jonathan Bourne

摘要：奥斯卡・王尔德曾言：“文学与新闻的区别在于，新闻不堪卒读，文学无人问津。” 遗憾的是，王尔德所处的 19 世纪数字化存档新闻文本，其光学字符识别（OCR）结果往往缺失或质量低劣，既降低了档案的可获取性，也使其在象征意义与字面意义上均成为 “不可读文本”。为解决这一问题，本研究采用预训练图像转文本语言模型 Pixtral 12B，对包含 8.4 万页 19 世纪英文报纸与期刊的《19世纪连续出版物典藏》（NCSE）开展光学字符识别。研究将该模型与四种主流OCR方法进行性能对比，结果显示其字符错误率中位数仅为1%，较次优模型低80%。本研究构建的NCSE2.0数据集优化了文章识别效果，具备高质量字符识别文本，并将文本划分为4个类别与17个主题，包含140万条条目与3.21亿个词汇。研究通过主题相似度、可读性与事件追踪分析验证了数据集的应用价值。该数据集免费开放，旨在助力历史学与社会学研究，让21世纪的读者得以足不出户识读这份“不可读文本”，共情王尔德对19世纪新闻水准的遗憾。

Abstract：Oscar Wilde said, ‘The difference between literature and journalism is that journalism is unreadable, and literature is not read’. Unfortunately, the digitally archived journalism of Oscar Wilde’s 19th century often has no or poor quality Optical Character Recognition (OCR), reducing the accessibility of these archives and making them unreadable both figuratively and literally. This paper helps address the issue by performing OCR on ‘The Nineteenth Century Serials Edition’ (NCSE), an 84k-page collection of 19th-century English newspapers and periodicals, using Pixtral 12B, a pre-trained image-to-text language model. The OCR capability of Pixtral was compared to four other OCR approaches, achieving a median character error rate of 1%, 5x lower than the next best model. The resulting NCSE v2.0 dataset features improved article identification, high-quality OCR, and text classified into four types and seventeen topics. The dataset contains 1.4 million entries, and 321 million words. Example use cases demonstrate analysis of topic similarity, readability, and event tracking. NCSE v2.0 is freely available to encourage historical and sociological research. As a result, 21st-century readers can now share Oscar Wilde’s disappointment with 19th-century journalistic standards, reading the unreadable from the comfort of their own computers.

图：边界框 “CLD-1853-07-30_page_2_B0C2R7” 分割为三个子框，重叠区域以深蓝色标注

Figure：Bounding box ‘CLD-1853-07-30_page_2_B0C2R7’ is split into three boxes. Overlap is shown as darker shades of blue.

非物质文化遗产剪纸纹样的数字化提取与分割

Digital extraction and segmentation of intangible cultural heritage paper-cut patterns

Daoling Chen , Pengpeng Cheng

摘要：剪纸艺术是中华传统文化中独具特色的非物质文化遗产，当前正面临传承与发展的双重困境。随着剪纸艺人离世、作品损毁流失，部分剪纸品类濒临消亡，其数字化保护工作迫在眉睫。本研究基于改进遗传算法自适应优化Canny算子阈值，结合Grab-Cut算法，实现非遗剪纸纹样的智能化提取与分割。研究首先通过双边滤波对采集的剪纸图像进行平滑处理，提升图像质量；其次采用改进遗传算法优化后的Canny算子，提取剪纸纹样整体轮廓；随后设计Grab-Cut算法，针对性分割剪纸设计元素轮廓，并借助CDR软件处理矢量图像，生成独立可编辑矢量文件；最后通过多种算法对比不同类型剪纸图像的轮廓提取效果。实验结果表明，本研究提出的方法可有效检测剪纸图像纹样真实边缘，完成轮廓提取，纹样各设计元素的像素分割精度达96%以上，为非遗剪纸艺术的数字化保护与创新应用提供了新方法。

Abstract：As a unique intangible cultural heritage in Chinese traditional culture, paper-cut art is now facing the dilemma of inheritance and development, and with the death of paper-cut artists and the damage and loss of paper-cut works, some paper-cut types also disappear. Therefore, the digital protection of paper-cut art is urgent. This research is based on the improved genetic algorithm adaptive optimization of Canny operator threshold and Grab-Cut algorithm to achieve intelligent extraction and segmentation of intangible cultural heritage paper-cut patterns. First, the collected paper-cut images are smoothed by bilateral filtering to improve the image quality. Second, based on the Canny operator optimized by the improved genetic algorithm, the overall contour of the paper-cut pattern is extracted. Then, the Grab-Cut algorithm is designed to segment the contours of decoupage design elements in a targeted way, and the vector image is processed by CDR software to obtain an independent editable vector image. Finally, the contour extraction experiments of different kinds of paper-cut images are compared by different algorithms. The results show that the method proposed in this article can effectively detect the true edge of the pattern in paper-cut images and complete the extraction of the pattern contour, and the accuracy of the segmentation pixels of each design element of paper-cut pattern is greater than 96 per cent. It provides a new method for the digital protection and innovative application of intangible cultural heritage paper-cut art.

图：剪纸图像平滑去噪处理 (a) 原始图像；(b) 增强后图像

Figure：Smoothing and denoising of paper-cut images. (a) Original and (b) enhanced.

丈量东方之声：华语流行歌词共性与特质的数字人文研究

Measuring sounds from the East: digital approaches to commonality and specificity in Chinese Mandarin pop lyrics

Zheyuan Dai , Haitao Liu

摘要：1978年改革开放以来，中国流行音乐产业高速发展，成为全球当代流行音乐的重要组成部分。本研究采用数字人文方法，对1978-2019年当代华语流行歌词开展系统性分析。研究运用文体计量指标布塞曼系数，从静态与动态双维度揭示了歌词文本的高活跃度，佐证了其叙事性的文体本质。歌词情感分析结果显示，文本整体呈现积极基调，而2000年成为重要分水岭，前后两个时期的情感特征呈现显著分化。主题建模分析进一步表明，不同时期歌词的主题分布模式呈现多元化特征，既体现了流行文化的普世性，也彰显了文化与时代语境塑造下的中国本土特质。

Abstract：Since China’s reform and opening-up in 1978, its popular music industry has experienced rapid development and emerged as a significant component of global contemporary pop music. This study conducts a systematic analysis of contemporary Chinese Mandarin pop lyrics (1978–2019) with digital methodologies. A stylometric measurement, the Busemann coefficient, was employed to demonstrate the lyrics’ high textual activity from both static and dynamic perspectives. This finding supports the lyrics’ stylistic essence as narratives. Besides, sentiment analysis of lyrics reveals an overall positive tone, though a significant watershed occurred around the year 2000, marking a divergence between the two periods. Furthermore, analyses with topic modeling demonstrate diversified topic distribution patterns across these periods, reflecting both the universality of popular culture and the distinctive Chinese characteristics shaped by cultural and temporal contexts.

图：潜在狄利克雷分配模型评估：一致性与困惑度对比

Figure：LDA model evaluation: Coherence vs. Perplexity.

文本可视化视域下江南禅诗对高丽汉诗的跨文化影响研究

Exploring the cross-cultural influence of song Jiangnan Chan Buddhist poetry on Goryeo Han poetry through text visualization

Yaqin Fu , Rongrong Fu , Linfeng Li

摘要：本研究选取三位代表性诗人，从词汇意象、时空解构、主题演变与情感表达四个维度，对比分析江南禅诗对高丽汉诗的影响。研究构建基于文本可视化的研究方法，融合词频 - 逆文档频率关键词提取、潜在狄利克雷分配主题建模与情感分析技术，辅以AI大模型新词挖掘与Kimi智能助手分词功能开展研究。研究得出四项核心结论：其一，两类诗歌均以自然与禅意为核心，共享相似的词汇与意象体系；其二，二者均体现永恒视角与历史意识的交织，以及宇宙宏大性与人世现实性的辩证关系；其三，主题演变层面，江南禅诗“自然与禅”的核心内核，在高丽汉诗中转化为“护国、弘法、济世”的三位一体框架；其四，江南禅诗侧重超脱与空寂的审美表达，高丽汉诗则融入世俗生活书写与历史记忆叙事。研究同时证实，该方法可有效提升古典诗歌文本的中文分词精度，彰显了量化对比分析在禅诗跨文化传播与嬗变可视化研究中的核心价值。

Abstract：This study examines the influence of Jiangnan Chan poetry on Goryeo Han poetry through a comparative analysis of three representative poets from four dimensions: vocabulary imagery, spatiotemporal deconstruction, thematic evolution, and emotional expression. It proposes a text visualization-based methodology employing Term Frequency–Inverse Document Frequency (TF-IDF)-based keyword extraction, Latent Dirichlet Allocation (LDA)-based topic modeling, and sentiment analysis, supplemented by neologism mining and Kimi Intelligent Assistant (KIMI)-based word segmentation using AI large models. The results reveal four key findings: (1) both poetries share similar vocabulary and imagery centered on nature and Chan Buddhism; (2) both reflect an interweaving of eternal perspective and historical consciousness, alongside a dialectic between cosmic vastness and human reality; (3) in terms of thematic evolution, the original focus on “Nature and Chan” in Jiangnan poetry was transformed in Goryeo Han poetry into a trinitarian framework of “national protection, Dharma propagation, and social salvation”; and (4) Jiangnan poetry emphasizes transcendence and emptiness, while Goryeo Han poetry incorporates depictions of secular life and historical memory. The findings also suggest that the proposed methodology is effective in improving the accuracy of Chinese word segmentation for ancient poetry texts. This study underscores the value of quantitative comparative analysis in visualizing the intercultural diffusion and transformation of Chan Buddhism poetry.

图：禅诗名词词云图 (a)雪窦重显；(b)杨杰；(c)义天

Figure：Noun word clouds of Chan Buddhism poetry: (a) Xuedou Chongxian; (b) Yang Jie; (c) Yitian.

近代早期英语的善恶语义空间：道德二元对立的计算语言学研究

The semantic space of good and evil in early modern English: a computational study of moral contrasts

Michael Gavin , Michael Witmore

摘要：本文提出一种创新研究方法，通过近代早期英语词向量的几何分析解读文化内涵。研究基于17世纪《早期英文图书在线》（EEBO-TCP）语料库，构建语义模型，梳理3.2万余个词汇的道德关联与性别属性。研究依托分布概念分析方法，设定善恶、女性-男性两大启发式坐标轴，将词汇映射至坐标轴中，挖掘语言使用内嵌的规范结构。情境、离散度与斜率等统计指标，揭示了概念与性别道德化宏观范式的适配规律。研究发现，多数词汇集中于“女性/邪恶”象限，而其语义邻域却常向抽象化、男性化与正向语义域倾斜。本研究提出全新指标——道德对比分数，可无监督挖掘文本潜在二元对立关系，识别塑造近代早期话语体系的文化维度，涵盖情感表达、信任机制、社会教化与领土合法性等范畴。研究勾勒出层次丰富的道德语义图景：道德对立并非静态固化，而是相互重叠、动态流变；该模型打破了固定二元对立的认知框架，证实语言是构建与重塑文化价值的动态高维场域。

Abstract：This article introduces a novel approach to understanding cultural meaning through the geometric analysis of word vectors in early modern English. Using a seventeenth-century corpus Early English Books Online (EEBO-TCP), the study constructs semantic models that chart the moral and gendered associations of over 32,000 words. Building on methods from distributional concept analysis, the authors define heuristic axes—good-evil and feminine-masculine—and project words onto these axes to explore normative structures embedded in language use. Statistical metrics such as situation, spread, and slope reveal how concepts align with broader patterns of moralized gender. Notably, most terms cluster in the ‘feminine/evil’ quadrant, yet semantic neighborhoods often slope toward abstract, masculine, and positive domains. A new metric, the moral contrast score, enables unsupervised discovery of latent binaries, identifying the cultural dimensions that structure early modern discourse—from affective expression and trust to social refinement and territorial legitimacy. The result is a richly layered moral landscape, where contrasts are not static but overlapping and mobile. Far from mapping fixed binaries, the model reveals a dynamic, high-dimensional field in which language organizes—and reconfigures—cultural values. This method offers a scalable, interpretive framework for investigating the moral geometry of meaning in historical texts.

图：核心词汇语义邻域图（左上：死亡；右上：戏剧；左下：牙买加；右下：信用）

Figure：The semantic neighborhood of death (upper left), play (upper right), jamaica (lower left), and credit (lower right).

复杂网络视角下现代简体汉字构形系统研究

Investigating the structural formation system of modern simplified Chinese characters from a complex network perspective

Wei Huang , Yonghui Xie , Junting Li

摘要：汉字在东方文明发展进程中具有核心地位，其构形体系具备系统性特征，而系统科学方法在汉字构形系统研究中的应用仍较为匮乏。为此，本研究基于复杂网络方法，探究现代简体汉字的宏观网络特征与核心系统属性。研究以汉字基础构件为节点、构件关联为边，构建共现网络、有向网络、加权网络三类网络模型及对应的随机对照网络；通过17项通用网络指标量化分析网络特征，深入阐释汉字构形系统的内在属性。研究结果表明，相较于随机网络，汉字构件网络呈现小世界性、无标度性、异配混合性、高中心性与层级组织性五大显著特征；这五大特征印证了汉字构形系统兼具复杂性、鲁棒性与经济性三大属性，分别源于构件的有序组合、紧密联结与高效配置。本研究成果为汉字构形系统的实证研究提供了重要补充。

Abstract：Chinese characters hold pivotal significance in the development of Eastern civilization. While their structural formation exhibits a systematic nature, system science approaches are seldom seen in analyzing the formation system. Therefore, based on complex network methods, this study aims to examine macro-scale network features and the essential system properties of modern simplified Chinese characters. Specifically, three types of networks were constructed: co-occurrence, directed, and weighted networks, along with their random counterparts. In these networks, primitive components served as nodes, and their relationships as edges. Subsequently, seventeen general network metrics were measured to analyze the network features, enabling deeper discussion of the system properties. Research results show that component networks exhibit five distinct network features compared to their random counterparts, including small-world feature, scale-free feature, disassortative mixing, high centrality, and hierarchical organization. These features demonstrate that the formation system displays three properties, namely complexity, robustness, and economy, which emerge respectively from organized, close-knit, and efficient component combinations. These findings serve as a significant supplement to the empirical research on the structural formation system of Chinese characters.

图：真实网络与随机网络全局拓扑图（a、c、e：真实网络；b、d、f：随机网络）

Figure：Global network graphs of real and random networks (a, c, e: network graphs of real networks; b, d, f: network graphs of random networks).

非遗审美中的情绪唤醒与视觉认知双路径研究——以硖石针刺灯彩画为例

A dual-path approach to emotional arousal and visual cognition in intangible cultural heritage: the case of Xiashi Pinprick Lantern Pictures

Wenwen Shi , Yi-Tong Cui , Yihong Liu , Weicong Li , Xinlong Li , Jing Sun

摘要：当前关于晚期Z世代对中国非物质文化遗产的审美体验与评价的实证研究仍较为匮乏。本研究以清代颜元庄硖石针刺灯彩画为研究对象，探究大学生群体的审美反馈，聚焦作品主题、潜在审美偏好及其与喜爱度的关联。研究采用双路径研究方法：其一，运用混合设计方差分析与语义差异量表，分析四大主题作品，通过因子分析与聚类分析提取审美潜在维度；其二，开展眼动实验，记录注视时长与注视次数，量化分析眼动指标与主观评分的相关性。研究结果表明：第一，园林主题灯彩画的喜爱度评分显著最高（P<0.001），主题与专业的交互效应不显著（P=0.094），说明不同专业群体的审美偏好具有一致性；第二，研究提取出情绪效价、形式审美、认知唤醒三大潜在维度，揭示了非遗审美的多层级体验结构；第三，潜在审美偏好与喜爱度呈极强正相关（r=0.94，P<0.01），而眼动注视指标与喜爱度无显著相关性（P=0.174；P=0.234）。研究证实，晚期Z世代对传统针刺灯彩艺术的偏好具有主题依赖性，由情绪与认知双重参与塑造。本研究结合主客观测量方法提供实证依据，深化了对晚期Z世代非遗审美认知的理解，为非遗传承与美育推广提供理论支撑。

Abstract：There is a limited empirical research on late Generation Z’s (Gen Z’s) experience and evaluation of Chinese intangible cultural heritage (ICH). This study examines university students’ aesthetic responses to Xiashi Pinprick Lantern Pictures by Yan Yuanzhuang (Qing dynasty), focusing on theme and latent preferences and their relation to liking. Using a dual-path approach, this study conducted (1) a mixed-design ANOVA and semantic differential scale to analyze four themes and extract latent dimensions via factor and cluster analysis, and (2) an eye-tracking experiment to record fixation duration and count, thus assess their correlations with subjective ratings. Results revealed: (1) the garden-themed pictures scored significantly higher in liking level (P < .001), with no significant theme–major interaction effect (P = .094), indicating consistent preferences across disciplines; (2) three latent dimensions, Emotional Valence, Form-Aesthetic, and Cognitive Arousal, were extracted, revealing a multi-layered structure of aesthetic experience; (3) latent preference strongly correlated with liking levels (r = 0.94, P < .01), whereas fixation-based metrics showed no significant correlation (P = .174; P = .234). These findings suggest that Gen Z’s preferences for traditional pinprick lantern art are theme-dependent and shaped by emotional and cognitive engagement. The study offers empirical evidence, through both subjective and objective measures, to better understand how ICH artworks are perceived by late Gen Z audiences, contributing to ICH transmission and the promotion of aesthetic education.

图：《黛玉葬花》眼动热力图

Figure： Daiyu Burying the Fallen Flowers eye-tracking heatmap.

从浪漫到现实：数字人文视域下华语流行歌词的词汇与主题演变

From romance to reality: lexical and topic evolution in Chinese popular lyrics through digital humanities approaches

Yubo Wang , Lüyuan Wang , Fang Xie , Haitao Liu

摘要：本研究探究2000-2025年华语流行歌词的词汇与主题演变，映射大众情感变迁与社会文化转型。现有研究多忽视非西方歌词语料库，本研究填补这一空白，构建涵盖25年、1560首代表性歌曲的华语流行音乐历时语料库。研究采用数字人文方法，融合词频分析、可读性指标与BERTopic主题建模，追踪歌词词汇与主题的历时演变。研究发现，歌词高频词汇始终围绕情感表达、个体自省与人际互动三大核心；类符形符比显著提升，表明歌词词汇丰富度持续增长，文本复杂度呈波动变化，反映出创作风格的动态转向。主题分析识别出浪漫爱情、未来期许、都市生活等12大核心主题，主题结构从自然与时间的抽象表达，逐步转向情感具象化与心理内省化书写；近年来，负面情感主题与自我指涉艺术母题的占比显著上升。本研究为音乐、语言与社会交叉领域研究提供了全新方法论框架，彰显了数字人文工具在大规模歌词分析中，描摹集体情感与文化变迁的核心价值。

Abstract：This study explores the lexical and topic evolution of Chinese popular music lyrics from 2000 to 2025, reflecting changing public sentiments and broader socio-cultural transitions. While prior research has largely overlooked non-Western lyric corpora, this study addresses that gap by constructing the Chinese Popular Music Diachronic Corpus, comprising 1,560 representative popular songs sampled across twenty-five years. Using digital humanities approaches, it integrates word frequency analysis, readability metrics, and BERTopic-based topic modelling to trace lexical and topic evolution of lyrics over time. Findings reveal that high-frequency words consistently revolve around emotional expression, individual introspection, and interpersonal dynamics. Type-token ratio (TTR) has increased significantly, indicating growing lexical diversity, while textual complexity shows fluctuation, reflecting stylistic shifts in song writing. Topic analysis identifies twelve major topics, including romantic love, future aspirations, and urban life, with topic structures evolving from natural and temporal abstraction to emotional concreteness and psychological introspection. Recent years show a significant rise in negative emotional topics and self-referential artistic motifs. This study contributes a novel methodological framework for interdisciplinary research at the intersection of music, language, and society, and underscores the value of digital humanities tools in mapping collective emotions and cultural change through large-scale lyric analysis.

图：各项指标历年变化趋势

Figure：Trends of various metrics over the years.

解码人工智能话语：基于机器学习的中德语境媒体表征对比研究（2018-2023）

Decoding AI discourse: contrastive analysis of media representations in German and Chinese contexts (2018–23) using machine learning techniques

Kuanyong Qiu , Noah Bubenhofer , Timo Hackel

摘要：现有研究多聚焦单一语言媒体中的人工智能（AI）话语，缺乏跨文化语境的对比分析。本研究优化露丝・沃达克话语分析框架，结合前沿机器学习方法，分析 2018-2023 年中德两国媒体话语中的人工智能表征。研究发现，中德媒体均聚焦本土相关的人工智能议题：中国媒体频繁援引政治人物言论（尤以习近平主席为核心），采用视角建构策略，对人工智能始终持积极立场；德国媒体则侧重科技人物叙事，尤其在ChatGPT发布后，对人工智能持更为批判与审慎的态度。两国媒体话语的差异，源于本土语境塑造的差异化媒介文化体系：中国媒体隶属于政党体系，将人工智能定位为推动经济发展的国家战略核心，传递政府核心立场；德奥瑞三国媒体则呈现多元视角，高度关注人工智能的潜在风险。本研究为各国人工智能政策的解读与制定提供了重要参考依据。

Abstract：Previous studies have primarily focused on artificial intelligence (AI) discourse within specific language media, with limited contrastive analyses across different cultural contexts. This study analyzes the representation of AI in German and Chinese media discourses from 2018 to 2023, employing a modified version of Ruth Wodak’s discourse analysis framework alongside advanced machine learning methods. Our findings indicate that both German and Chinese media concentrate on AI issues pertinent to their regions. Chinese media adopt a perspective strategy by frequently quoting political figures, particularly President Xi Jinping, and consistently maintain a positive stance on AI. Conversely, German media, especially after the launch of ChatGPT, highlight high-tech figures and adopt a more critical and cautious approach toward AI. These differences in media discourses arise from distinct media cultural systems shaped by their respective contexts. In China, media outlets are party-affiliated and promote a narrative framing AI as a national strategic endeavor crucial for economic growth, reflecting governmental viewpoints. In contrast, media from Germany, Austria, and Switzerland present diverse perspectives on AI, expressing significant concerns about its potential risks. This study offers valuable insights for interpreting and formulating AI policies across different nations.

图： 2018-2023 年中国媒体人工智能语料库十大子主题历时变化

Figure:Top ten subtopics over time 2018–2023 in Chinese media AI-related corpus (Color version of the figure is available in the supplementary data).

文化分析与表征政治：埃及《箴言》杂志中犹太群体形象的呈现研究（1933-1953）

Cultural Analytics and the Politics of Representation: Mapping the Jewish Presence in Egypt’s al-Risālah (1933–1953)

Eid Mohamed , Said Hassan

摘要：本文融合文化分析与后殖民理论，探究1933-1953年埃及主流文学期刊《箴言》中犹太群体的形象表征。研究运用数字文本分析方法，识别并解读期刊档案中犹太形象的建构范式，揭示两次世界大战之间及战后初期，阿拉伯知识分子在政治张力下对犹太身份的协商与阐释。这一跨学科研究方法实现了量化建模与文本细读的结合，精准剖析文化生产机制，呈现这一关键历史节点阿拉伯印刷话语的内在矛盾性。本研究的核心价值在于，对锡安主义兴起与以色列建国这一重塑阿拉伯世界族群关系的关键时期的媒介文本开展数字化研究：随着阿拉伯犹太人大规模撤离埃及，《箴言》中犹太形象的动态演变，折射出社会层面对民族身份、宗教差异与殖民遗产的集体焦虑。全文分为两部分：第一部分将《箴言》置于反殖民与后奥斯曼知识思潮中，阐释其成为阿拉伯现代性多元理念博弈平台的历史背景；第二部分结合远读与阐释性分析，梳理期刊对犹太人的三重身份建构——现代性符号、西方帝国主义受害者、阿拉伯主权威胁者。这些形象建构并非单纯的历史遗存，更深刻影响着当代族群宗教身份与政治归属感的认知。本研究将数字人文工具与知识生产的后殖民批判相结合，为阿拉伯印刷文化、形象表征与媒介史研究提供了全新的方法论与认识论范式。

Abstract：This article investigates the representation of Jews in al-Risālah, a major Egyptian literary magazine (1933–53), by integrating Cultural Analytics and postcolonial theory. Using digital text analysis methods, it identifies and interprets patterns of Jewish representation within al-Risālah’s archive, uncovering how Arab intellectuals negotiated Jewishness amid the political tensions of the interwar and early postwar periods. This interdisciplinary approach enables a nuanced examination of cultural production, bridging quantitative modelling with close textual reading to reveal the ambivalences of Arab print discourse during a pivotal historical moment. The study’s significance lies in its digital examination of media texts published during the rise of Zionism and the establishment of Israel, an era that reshaped intercommunal relations across the Arab world. As Arab Jews began departing Egypt en masse, al-Risālah’s evolving depictions of Jews became symptomatic of broader anxieties over national identity, religious difference, and colonial legacies. Structured in two parts, the first contextualizes al-Risālah within anti-colonial and post-Ottoman intellectual currents, illustrating how the magazine became a platform for competing visions of Arab modernity. The second part synthesizes distant reading with interpretive analysis to assess how al-Risālah conceptualized Jews: as symbols of modernity, victims of Western imperialism, or threats to Arab sovereignty. These portrayals are not merely historical curiosities but inform contemporary understandings of ethno-religious identity and political belonging. By fusing digital humanities tools with a postcolonial critique of knowledge production, this study contributes a new methodological and epistemological model for analysing Arab print culture, representation, and media history.

图：《箴言》杂志核心关键词年度出现频次

Figure.Recurrence of key words in al-Risālah by years.

编译｜洪冰凤

校对｜罗斯鹏

排版｜魏翔

阅读原文

跳转微信打开

2 - 北京大学公众号

文脉新生・数智赋能：“我用 AI 校古籍”（2026年）计划启动仪式圆满举行！

2026年4月17日 22:43

PKUDH 2026-04-17 22:43 北京

2026年4月11日，由全国高等院校古籍整理研究工作委员会主办，山东大学国际汉学研究中心、山东大学古籍文献研究所与字节跳动公益联合承办的2026年 “我用AI校古籍”——我是“校书官”古籍大众智能整理计划启动仪式暨海外汉籍资源汇聚与典籍活化前沿论坛在山东大学中心校区举办。该会议由字节跳动公益团队策划并筹办，聚焦“我用AI校古籍——我是‘校书官’古籍大众智能整理计划”活动的已有成果与经验，正式启动“我用AI校古籍”2026年度相关工作。共有来自北京大学、清华大学、复旦大学、山东大学等高校的学者代表，来自68所高校、76个承办高校的教师，以及抖音平台各类知识博主约100余人参加了本次启动仪式。山东大学副校长曹现强、全国高等院校古籍整理研究工作委员会秘书长卢伟、教育部语言文字应用管理司副司长王晖、抖音集团企业社会责任部产品与运营总经理罗海岳出席启动仪式并致辞。

北京大学数字人文研究中心与字节跳动公益联合打造的“识典古籍”平台，作为全球规模领先的古籍智能化整理与数字化阅读平台，是“我用 AI 校古籍”活动的依托平台。平台深度融合人工智能技术，构建起从古籍OCR技术、自动标点、实体识别到多版本智能校勘的全流程解决方案，彻底打破古籍整理的专业壁垒，让普通大众深度参与中华文脉传承成为现实。

2024年7月，由全国高等院校古籍整理研究工作委员会、北京大学数字人文研究中心、字节跳动公益联合国内多家古籍研究机构共同发起 “我用AI校古籍 —— 我是‘校书官’古籍大众智能整理计划”，依托识典古籍智能整理平台，广泛动员高校学生与社会公众参与古籍数字化整理工作。项目自启动以来，已经历经三期稳步迭代，参与规模与整理深度持续提升。截止2026年4月，已经累计完成2万余部古籍整理，其中粗校16亿字、精校1亿字；吸引4.2万余名参与者，覆盖全国1450余所高校，总曝光超30亿次，先后获人民网、新华社、《科技日报》等权威媒体报道，并通过20余位文化达人实现广泛传播。

2026年度的大众整理活动，将由国内68所高校，76个单位参与承办该项活动，承办高校主要负责动员与组织本校相关专业学生参与本项活动。活动时间为3月26日至11月30日，分春季、暑期、秋季三个阶段开展，重点围绕海外汉籍汇聚、典籍活化传播两大方向，持续以科技赋能、创新表达推动古籍走近大众。

在活动启动仪式上，山东大学副校长曹现强介绍了山东大学在古籍整理、全球汉籍合璧工程等方面的成果，表示山东大学将持续深化AI技术与古籍整理融合，助力中华文脉永续传承。全国高等院校古籍整理研究工作委员会秘书长卢伟回顾了 “我用AI校古籍” 前几期的活动成效，介绍了2026年活动开展的重点方向，并宣布优化招募机制，推动活动专业化、规模化发展。教育部语言文字应用管理司副司长王晖高度评价AI赋能古籍整理的创新价值，结合人工智能与教育行动计划，从守正创新、以文铸魂、数字赋能等方面提出期望，鼓励多方协同打造语言文化传承品牌，让青年成为古籍传承主力军。抖音集团企业社会责任部产品与运营总经理罗海岳分享识典古籍平台功能升级情况，说明2026年将深化与各承办高校的合作、支持学者项目研究、开展征文及创作者共创等古籍活化传播活动，持续以技术降低公众参与门槛，推动古籍实现活态传承。

山东大学副校长曹现强致辞

全国高等院校古籍整理研究工作委员会秘书长卢伟致辞

教育部语言文字应用管理司副司长王晖致辞

抖音集团企业社会责任部产品与运营总经理罗海岳致辞

左右滑动查看更多

“我用AI校古籍”活动开展以来，成千上万社会大众与高校青年积极参与其中。在启动仪式上，年届74岁高龄的常兰藻先生作为大众代表来到现场分享参与“我用AI校古籍”活动的体会，作为一名退休会计，退休后因为学习《资治通鉴》的有关知识，偶然发现识典古籍平台并开始利用AI校对古籍，4个月时间已经校对完19卷、近20万字古籍。看见自己名字出现在上架古籍中，老爷子感到无比骄傲。年仅21岁的山东大学大四学生王思彤则代表青年学子发声，分享自己立足在校参与项目、借助平台参与古籍整理的实践体验，分享了利用AI技术辅助古籍整理体会到的高效便捷与参与活动得到的成长锻炼。

优秀志愿者常兰藻先生发言

优秀志愿者山东大学王思彤同学发言

左右滑动查看更多

在论坛的主题报告环节，专家学者们围绕古籍整理、数字赋能、大众参与以及学科发展等多个维度分享前沿实践与深入思考。“我用AI校古籍”活动所整理的一部分重要的古籍资源，来自哈佛燕京图书馆等海外图书馆。北京大学中文系杨海峥教授介绍了如何在海外流传古籍馆藏目录基础上，利用人工智能技术，通过海量元数据提取与流传轨迹重建，构建起大规模古籍流传数据集的过程。

新时代大型古籍整理项目北大《儒藏》工程的全本编纂正在利用“识典古籍”整理平台展开编纂工作，2025年度已经在平台上线约50种经过精心校勘整理的古籍资源。北京大学《儒藏》编纂与研究中心李畅然教授分享《儒藏》数字化与识典古籍平台的实践经验，提出提升OCR技术准确率、完善标点符号等具体技术需求，希望平台功能更加贴合专业古籍整理的需求。

北京大学数字人文中心副主任杨浩副研究员指出历史上古籍整理多由官方与学术精英主导，存在专业门槛高、效率有限、难以普及的局限，以 “我用AI校古籍” 为代表的大众参与模式，让广大青年学子与普罗大众得以加入，既提升了整理效率，也实现了沉浸式的文化传承。活动通过多层级校勘体系兼顾普及性与学术严谨性，既保障了古籍整理质量、培育后备人才，也致力于构建开放共享的数字人文生态。

全国高校古籍整理委员会副秘书长吴国武教授肯定“我用AI校古籍”活动在古籍整理发展历程中具有重大历史价值，体现出AI 技术深度赋能、大众志愿广泛参与、古籍整理范式持续革新的新形态，并指出古籍传承展现出从精英小众走向大众普惠、走向国际传播的新趋势。

北京大学中文系古典文献教研室主任杨海峥教授主题报告

《古籍流传的大规模数据集构建》

北京大学《儒藏》编纂与研究中心李畅然教授主题报告

《两条腿走路——OCR与语义两模块的协同》

河南大学出版社总编辑、河南大学文学院孔令刚教授主题报告

《古籍数字化的双向赋能与协同共生》

抖音集团社会责任部古籍项目经理张明月主题报告

《科技助力古籍保护与传承》

北京大学数字人文中心副主任杨浩主题报告

《众手续文脉：智能时代古籍大众整理的价值与路径》

山东大学国际汉学研究中心主任刘心明教授主题报告

《文本化是古籍整理的大方向》

山东大学古典研究所所长王承略教授主题报告

《中国文化典籍的家底与古籍整理定本的意义》

全国高等院校古籍整理研究工作委员会副秘书长田国武教授总结报告

左右滑动查看更多

论坛的下午场，到场学者们分享借助识典古籍平台与已有的大众整理成果，展开的学术研究与资源应用。例如：复旦大学文史研究院段志强副研究员，基于平台上整理的古籍资源，深入解读古籍中所反映的古代基层社会与普通人生活；清华大学人文学院严程副教授，利用识典古籍平台建设专题文献库“女子艺文数据库”，开展女性文献等专项研究，分享了清代多位具有独特性格的女子的生动故事；清华大学人文学院唐宸副教授分享利用数字人文方法对李白与李璘谋反事件关系的新研究，特别展示了如何利用识典古籍平台深度研究助手补上部分关键史料的例证。学者们以文献解读、案例实证、数字化工具应用等形式，系统呈现研究进展与实践心得，既具备扎实史料价值，又为新时代古籍活化利用提供了现实案例。

复旦大学文史研究院副研究员段志强

《在古籍中发现普通人的生活》

清华大学人文学院副教授严程

《盘一盘自带爽感的清穿女》

北京体育大学人文学院副教授黄二宁

《跃然纸上——唐代文献中的体育世界与大众传播的可能性》

清华大学人文学院副教授唐宸

《数字化时代的古典文献挖掘——以字节跳动“识点古籍”为例》

山东大学全球汉籍合璧工程首席专家助理陈肖杉

《形制与传播——境外汉籍流播管窥》

左右滑动查看更多

在古籍活化方面，创作者们开始频繁利用平台整理成果为根基进行创作传播。例如：抖音专攻五代历史的知识博主杨利辉，在创作过程中经常利用识典古籍平台的资源与工具展开创作，原本他制作有多种检索小工具方便创作，在利用识典古籍平台之后，自用软件多已不再更新。自媒体博主Asu、艾斯利用平台的古籍资源中挖掘 “十二花神” 等文化意象进行年轻化表达，让严谨的古籍整理成果走向大众。创作者们从古籍创作方法论、知识自媒体运营、二次元年轻化表达、小众历史趣味科普等多个维度展开分享，集中展现了前期大众参与古籍整理所形成的高质量成果，为古籍活化创作提供了坚实内容支撑。论坛的最后，创作者们还围绕古籍如何实现通俗化表达、年轻化传播与跨圈层破圈进行深入交流与讨论，认为古籍活化必须坚守学术严谨，同时力求通俗易懂而不失文化底色。

创作者子非秋月

《裁取云间墨，化得千亿身——浅谈短视频创作对古籍素材的选取与运用》

文化搬运委员会

《学术叙事的生活化转向：科普自媒体的叙事策略与实践创新》

Asu、艾斯

《谁说古籍只能藏在书斋？我们把“十二花神”变成了全网爆款》

杨利辉

《一个小众赛道的自媒体博主独白：从我和<太平年>的故事说起》

左右滑动查看更多

整场论坛展现出从大众整理产出成果，到学者利用整理成果做研究，再到研究成果反哺平台与传播的一条完整闭环。可以说，从顶层支持到学术论证，再到成果应用与大众参与，古籍整理仅靠专业学者还不够，只有借助AI技术与大众力量，才能真正破解海量文献古籍数字化难题。正是基于这样的共识与实践基础，“我用 AI 校古籍——我是‘校书官’古籍大众智能整理计划” 才得以持续推进、不断升级，成为连接学界、平台与社会大众的重要文化工程。

回望“我用AI校古籍” 活动的发展历程，从理念萌发到模式探索，其源头实际上是出于一场巨大的现实挑战。2022年3月，北京大学数字人文中心与字节跳动公益合作，开始建设“识典古籍”阅读与整理平台，旨在人工智能与计算机技术的辅助下，变革古籍整理方式，在三年内完成万种古籍的整理工作。2022年10月，识典古籍阅读平台正式上线对外发布，但资源建设是亟须解决的瓶颈难题。2023年11月，北京大学数字人文中心接受哈佛大学燕京图书馆捐赠的近万种中国善本古籍数字化书影。根据捐赠协议，这批古籍扫描书影连同所识别点校后文字，需要全部发布在古籍数字化平台上。然而，利用OCR技术对古籍图像识别之后，还必须经过人工校对才能达到更高的准确率，才能更好地被学界与大众利用，然而专业整理力量有限，人力成本奇高。因此，北大数字人文研究中心王军教授提出，必须借助人工智能技术降低参与门槛，以众包模式发动社会大众参与古籍整理，用普通大众力量破解古籍数字化难题。

传统古籍整理依赖少数具备深厚学养的专家学者，需要丰富的专业知识，专业门槛很高。大众是否能够参与古籍整理这样高度专业的工作，得到很多专家的担忧。因此，在项目正式启动前，北京大学数字人文研究中心召开多次小范围的专家论证会。2024年6月，国家古籍保护中心、北京大学数字人文研究中心、抖音公益等多家机构联合举办了“智能时代古籍整理与大众传播”研讨会。此次研讨会汇集了来自国内各大高校、图书馆和出版社的七十余位专家学者。与会学者就智能信息环境下古籍整理与大众传播的机遇与挑战进行了深入交流。活动还得到“到梦空间”第二课堂平台的支持，符合条件的同学可获得实信网认证的电子版实践证书，成为活动成功举办的重要激励机制之一。2024年7月，在古委会主办下，在各方的大力支持下，“我是校书官古籍大众智能整理计划” 第一期正式启动。

为了让这一构想切实落地见效，为项目开展筑牢基石，北京大学数字人文研究中心联合字节跳动公益推出《古籍智能通识课》《识典古籍智能整理培训课》等系列课程，课程内容涵盖古籍书史、海外汉籍、古籍整理以及历史上的大型丛书等内容，还结合识典古籍智能整理平台，讲解从OCR文字识别到文字精校、结构整理、标点校对等多个环节的技术和方法，将文献学知识转化为实践体系，将人才培养与古籍整理实践相结合。

2024年7月活动首期试行，面向大学生开展OCR校对工作，共有742所高校、超2500名学生参与，取得远超预期的阶段性成果。2024年9-12月第二期同时开设大众组、进阶组任务，参与人数突破1万人，覆盖高校增至1210所，整理规模与参与广度大幅提升，并尝试探索古籍数字化与专业教学、人才培养相结合的新模式。2025年第三期在前期经验基础上进一步拓展范围与深度，联动含清华大学、武汉大学、中国人民大学、中山大学、四川大学等22所承办高校以及1450余所参与高校，构建起 “主办单位统筹+承办高校执行+社会力量参与” 的协同机制。

可以说，“我用AI校古籍”活动不仅是一次古籍整理的技术创新，更是一场面向全民的文化传承实践。项目正逐步构建开放、共享、协作的古籍整理新生态。未来将进一步带动全国高校学生与社会公众共同参与古籍数字化整理，推动中华典籍实现全民传承、活态传承，让古籍真正融入大众的日常生活与精神文化生活，助力中华优秀传统文化创造性转化与创新性发展。

阅读原文

跳转微信打开

2 - DHQ（Digital Humanities Quarterly）

Reconfiguring the Ideal Order: Ablation and Field Formation in the Twentieth-Century Nigerian Novel in English

作者Grant Hamilton, Chinese University of Hong Kong · Eli Park Sorensen, Chinese University of Hong Kong

2026年4月18日 08:00

What if the most structurally important books to a literary tradition aren't its most famous? This paper introduces a new computational method by which the critic can determine the texts that are foundational to the semantic architecture of a corpus.

2 - DHQ（Digital Humanities Quarterly）

Infrastructures of Listening: The ManoWhisper Podcast Analysis Pipeline

作者Nick Ruest, York University, Digital Scholarship Infrastructure · Brianna I. Wiens, University of Waterloo, English Language & Literature · Karmvir K. Padda, University of Waterloo, Sociology and Legal Studies · Shana MacDonald, University of Waterloo, Communication Arts

2026年4月18日 08:00

ManoWhisper is an end-to-end research pipeline for collecting, transcribing, and analyzing hateful and misogynistic podcast content, built to support peer-reviewed and policy-facing research on gender-based extremism. This paper argues the tool reframes harmful media as a site of feminist methodological inquiry, with implications for understanding how such content spreads across platforms and into AI training data

2 - DHQ（Digital Humanities Quarterly）

The Eras Tour: Machine Learning for Dating Historical Texts from Greco-Roman Egypt

作者Danushka Bandara, Fairfield University · Adrian Gallant, Fairfield University · Habibul Huq, Fairfield University · Fatima Chowdhury, Fairfield University · Jack Stow, Fairfield University · Giovanni Ruffini, Fairfield University

2026年4月18日 08:00

This paper introduces a machine learning approach to predicting the authorship dates of historical texts by using named entities — specifically, person and place names — as temporal markers.

13 - 韩国数字人文协会（KADH）

[채용] 전임 / 철학+창작 / 숭실대 2026-2학기

作者Baro

2026年4月17日 22:12

https://scatch.ssu.ac.kr/%EA%B3%B5%EC%A7%80%EC%82%AC%ED%95%AD/?slug=2026-2%ed%95%99%ea%b8%b0-%ec%88%ad%ec%8b%a4%eb%8c%80%ed%95%99%ea%b5%90-%ea%b5%90%ec%88%98-%ec%b4%88%eb%b9%99-%ea%b3%b5%ea%b3%a0-%ec%a0%95%eb%85%84%ec%a7%81-%eb%b0%8f-%eb%b9%84%ec%a0%95%eb%85%84

인터넷 접수기간 : 2026.4.17.(금) 15:00 ~ 5.4.(월) 15:00

철학과	독일근현대철학	Modern and Contemporary German Philosophy	1	1. AI연구, 강의 가능
예술창작학부문예창작전공	시 창작 및AI 생성문학	Poetry Writing & AI-Generated Literature	1	1. 시 전문지 편집위원 2. 국내 저명 시 문학상 수상자 3. 최근 5년간 ‘AI와 문학’ 관련 연구업적 (저서, KCI등재학술지 포함) 1편 이상

철학과	독일근현대철학	1. 최근 5년간 한국연구재단 등재 학술지 4편 이상 게재 (A&HCI 주저자 논문 및 단독저서는 KCI등재 학술지 논문 2편으로 인정, 단독 번역서는 KCI 등재 학술지 1편으로 인정함)
예술창작학부문예창작전공	시 창작 및AI 생성문학	1. 석사 학위 이상 소지자 2. 최근 10년간 시집 3권 (시론 관련 저서 포함) 이상 출간

게시물 [채용] 전임 / 철학+창작 / 숭실대 2026-2학기이 KADH / 한국디지털인문학협의회에 처음 등장했습니다.

12 - 南京师大比特人文公众号

《人工智能语料库标注术语》参编单位征集

2026年4月17日 07:30

徐惠 2026-04-17 07:30 江苏

教育部语用所启动《人工智能成都话拼音标注规范》研制，征集参编单位与专家，5月30日前报名。

转载自“语标智研”

为深入推进人工智能领域语言文字规范标准建设，服务大模型时代高质量语料数据治理需求，受有关司局委托，教育部语言文字应用研究所开展《人工智能语料库标注术语》标准研制工作。

本规范聚焦语料库建设与标注实践中的核心术语需求，梳理全流程概念体系，明确术语定义、分类框架及使用规则。成果可服务于数据处理与应用等环节，为减少术语歧义、提升应用一致性提供支撑。

现面向全国高校、科研院所、人工智能企业、数据标注服务机构及重点行业单位，诚邀相关领域具有研究基础与实践经验的单位和个人积极参与，共同推进术语标准的研制与产业落地应用。

请有意参与编制的单位及专家于2026年5月30日前扫描下方二维码或复制链接（https://f.wps.cn/ksform/w/write/383ZQL8p #routePromt）进行报名。

比特人文

投稿邮箱：dhbase@126.com

扫码关注获取更多资讯

阅读原文

跳转微信打开

3 - 人民大学公众号

《全球视野下的中国古籍数智化演进与未来趋势报告》全文

2026年4月17日 14:06

原创数字人文研究 2026-04-17 14:06 北京

中国人民大学数字人文研究院2026年4月发布

点击下载全文：全球视野下的中国古籍数智化演进与未来趋势报告.pdf

排版：覃子椿

初审：徐碧姗

复审：段婧怡

终审：夏翠娟

阅读原文

跳转微信打开

3 - 人民大学公众号

段玉裁《说文解字注》知识库的构建与应用

2026年4月16日 19:30

原创沈小妮等 2026-04-16 19:30 北京

段玉裁；《说文解字注》；《说文解字读》；知识库；知识表示

转载请注明“刊载于《数字人文研究》2025年第4期”;参考文献格式:沈小妮,彭炜明,胡佳佳.段玉裁《说文解字注》知识库的构建与应用[J].数字人文研究,2025,5(4):68-83.全文PDF已在知网、万方及编辑部网站(http://dhr.ruc.edu.cn)上发表，此处注释及参考文献从略。

段玉裁《说文解字注》知识库的构建与应用

沈小妮彭炜明胡佳佳

摘要：段玉裁《说文解字注》是“说文学”研究的巅峰之作，集中体现了乾嘉学派的学术成就。当前有关《说文解字注》的数字化工作多停留于文本化阶段，尚未实现对其知识体系的深度挖掘与系统呈现。研究以《说文解字注》及其稿本《说文解字读》为底本构建知识库，设计了涵盖五大知识范畴、十个知识集、五十六个知识点的三层分类体系，完成了数万条知识实例的标注与结构化表示；在此基础上，开发了具备原文检索、知识导览与标注管理功能的交互式平台；并以对段玉裁的谐声归部研究为例，验证了知识库的应用潜力。这是数字人文方法在传统语言文字学领域的一次系统实践，为古籍深度数字化与人文研究范式创新提供了重要参考。

关键词：段玉裁；《说文解字注》；《说文解字读》；知识库；知识表示

作者简介：沈小妮，北京师范大学第二附属中学语文教师；彭炜明，北京师范大学汉字汉语研究与社会应用实验室研究人员；胡佳佳（通讯作者），北京师范大学文学院副教授，hjj81@126.com。

引言

2022年4月，中共中央办公厅、国务院办公厅印发《关于推进新时代古籍工作的意见》，再次强调推进古籍的数字化工作，“积极开展古籍文本结构化、知识体系化、利用智能化的研究和实践，加速推动古籍整理利用转型升级”。在我国古典目录学的著录中，有一类被系统归为“小学”类的特殊文献，即中国传统的语言文字学著作。“中国的传统语言学因为负载着三千年的文化信息，包涵着中华民族的世界观、价值观、民族观，所以，它不仅是一种学术资源，而且是一种知识资源；不仅是一种知识资源，而且是一种人文资源，一种道德资源。” 《说文解字》（以下称《说文》）是中国传统语言文字学最重要的著作。黄侃谓治“小学”须读十部书，而“《说文》一书，于小学实主中之主也”。段玉裁的《说文解字注》（以下称《段注》）是《说文》研究著作中最重要的一本，也是中国传统语言文字研究巅峰——乾嘉学术的代表。段氏的卓越成就，“不仅在于他‘究其微恉，通其大例’，对许书做了细密全面的校勘整理，更在于他通过对许书的注释，提出并初步解决了一系列有关汉语音韵学、文字学、词汇学、训诂学的重大问题，他能初步运用历史发展的观点和一些科学的方法来研究语言现象。换言之，他使《说文解字》的研究，从纯粹校订、考证的旧框子里解放出来，在某种意义上走上了科学语言学的轨道”。

在建设中国特色哲学社会科学学术体系的今天，系统梳理和总结以《段注》为代表的中国传统语言学研究高峰的学术思想与方法论，对总结和升华中国特色的语言文字学原创理论，具有特别重要的意义。从清代到当代，对《段注》的研究一直持续不断，但进入21世纪之后，研究成果却并没有极大丰富，对《段注》挖掘的广度与深度也不够充分。《段注》体大思精，其深刻的思想与科学的方法是散布在丰富的注释中的。《段注》知识库的建设，旨在将这些散布的知识与前人对《段注》的研究结合起来，通过多维联系来展现和揭示其中蕴含的科学内涵，辅助研究者从中提炼出具有现代价值的理论范式，实现对传统学术资源的创造性转化；为构建具有中国特色的理论体系提供重要的数字化研究资源和技术支撑。

目前对于《段注》的数字化工作大多仍处于影像化与文本化的阶段。如国家图书馆建设的中国古籍资源库（http://read.nlc.cn/thematDataSearch/toGujiIndex）里汇聚了多个古籍版本的《段注》高清数字影像；北京时代瀚堂科技有限公司的瀚堂典藏库（https://www.hytung.cn/）提供了经韵楼本《段注》的数字化文本和影像，可以进行字头与全文的字符串检索。北京师范大学的《说文解字》研究与应用平台（https//szsw.bnu.edu.cn），则在影像化和文本化的基础上，对《说文》小篆的形音义知识点进行了提取和结构化存储（如部首、义符、声符……）。我们把这种将知识点单独提取后再进行结构化存储的数据库称作知识库。与之前仅包含了影像、文本、著述信息与目录的古籍数据库不同，知识库是对古籍数字资源的更深层次和更细粒度的加工，它包含了根据前人研究，从原典中抽取、提炼出的具有意义的独立的知识成果，并按照一定的逻辑体系进行关联和组织，从而实现对原典内容的理解、重组和再造。如图 1所示，由于此平台提取和存储了《说文》关于每个小篆构形直接构件的知识点，用户就可以直接搜索到《说文》中以“七”为直接构件的所有小篆，这实际上就是平台为用户提供了关于小篆构形的知识服务。该平台同时提供了上海古籍出版社影印的经韵楼本《段注》的数字影像与文本，也涉及到《段注》个别知识点的提取，如为每个字头标注了段玉裁的古音归部。

图1 北师大《说文解字》研究与应用平台应用示例

知识库的构建是在平台已有《段注》文本与个别知识点的基础上，聚焦《段注》全体系知识点的系统提取与深度标注，最终构建的《段注》知识库具备双重应用价值——既可独立开发为专用应用平台（详见下文），也能作为《说文》研究的资源，通过对应字头链接至该平台的相关说解部分，为使用者提供更为精准的知识服务。

图2呈现了构建《段注》知识库与开发应用平台的实施路线图。本文第1、第2部分将围绕此路线图展开具体论述，第3部分则结合一个基于该知识库及平台的研究实例，进一步说明其应用价值。

图2《段注》知识库构件与应用平台开发实施路线图

《段注》知识库的构建

1.1 文本的数字化与结构化

知识库的构建首先还是需要文本的数字化。文本数字化的第一步则是选择合适的底本。段玉裁注《说文》，经历了长达近四十年的准备、撰写和修订，终于嘉庆二十年（1815）全部刊刻完成，是为经韵楼刻本。“除《清经解》所收的少数本子是重排本外，其余都是经韵楼本的复刻本，不但内容相同，每页的行数、起迄字也都相同。” 段氏自己曾在《段注》中提及，著此书前，先有长编《说文解字读》（以下称《说文读》），始为五百四十卷，既乃檃括成《注》。今北京图书馆藏有《说文读》抄本残卷七册（一至六，八），经朱小健、张和生整理，于1995年由北京师范大学出版社影印刊行[7]。尽管残缺，但它对于研究段玉裁的学术思想、《段注》的形成过程以及清代小学方法都具有极高的价值。

因此，用于知识库构建的基础文本有两种：一是上海古籍出版社1988年影印经韵楼本《段注》，二是北京师范大学出版社1995年出版的《说文读》。对于数字化后的文本，则采用关系数据库技术进行结构化存储。基于《段注》的特点，分卷、部首、字头三层设计数字化文本结构。卷表（Volume）（表1）关注《段注》的卷次结构。部首表（Radical）（表2）聚焦于《说文》部首，全书总计540个部首，每个部首分布于特定卷中，并统领若干字头。字头表是《段注》的核心内容，主要包括字头、许氏原文、段氏注文三大部分。许氏原文、段氏注文存储在字段“zhengwen_zhushi”中，用<pn></pn>标注许氏原文，用<zn></zn>标注段氏注文。由于《说文》字头有正篆（位于每条说解之首的字形）和重文（位于每条说解之末的字形）之分，而《段注》亦对书中的重文进行了注释，所以字头表分正篆表（表3）和重文表（表4），两张表的字段类型基本相同。但重文表中包含对应正篆字段，以标示正篆与其重文的对应关系。

《说文读》与《段注》体例不同，是札记性质的，属于段玉裁长期研读《说文》心得的汇总。它不是依《说文》逐字做注，故多有未注之字，也有重出之字。行文格式上，低一格抄录许书正篆及说解全文，也有仅出重文；字头皆以楷书录之，但也存有篆书的情况，另起顶格作注；论及一字古音尚未使用古韵十七部，而是标以古韵韵目，但也有例外。《说文读》的结构亦分卷、部首、字头三个层次。卷表（du_volume）和部首表(du_radical)字段保持不变。字头表是《说文读》的核心内容，主要包括字头、许氏原文、段氏注文三大部分。许氏原文、段氏注文存储在字段“zhengwen_zhushi”中，用<pn></pn>标注许氏原文，用<zn></zn>标注段氏注文。

两书字头之间的对应关系，通过《读》《注》关系（duzhu_relation）表（表5）的两个外键来实现。其中，“duanzhu_id”实现两书字头的对应关系，以便从《说文读》直接链接到《段注》的相应解释，进行文本比较。

1.2 《段注》知识体系的建构

不同领域的知识要素和结构各不相同。《段注》自成体系，在注释《说文》时采用了很多的术语与条例，如“浑言析言”“古今字”等，集中体现段玉裁的学术思想和理论，也是本研究要提取的“知识点”。具体操作上，我们选取《段注》研究专著中形成共识的术语条例，以《传统语言学辞典》中关于段玉裁的术语为补充，选择实例数超过十条的知识点，并按照传统语言文字学的研究框架，构建了含文字、训诂、音韵、校勘和引书五大知识范畴、十个知识集、五十六个知识点的三级分类知识体系（见表6）。

需要说明的是,本研究将《段注》知识点分为五大范畴,主要出于系统整理的需要,并非意味着知识点之间存在不可逾越的界限,例如“双声”“叠韵”属于训诂学中的声训术语,但是也体现了音韵学研究的内容;“音义同”包含训诂和音韵两个角度的内容。范畴之间的交叉渗透,也体现了段氏综合形、音、义注解《说文》的特点。

1.3 《段注》知识点实例的提取

创建好知识分类体系，就可以逐个开展各知识点实例的提取工作了。《段注》中的知识点实例一般都存在形式化的术语，譬如，段氏谈论“引申”的术语有：“引申（引伸）、因、故、因之、因以为、故以为、引申（伸）假（叚）借、引申......凡、义之相因” 。这就便于采取正则表达式进行匹配提取后再进行人工校对与补充、完善（详见图 2知识提取部分）。《段注》中一个字头下的文本中可能蕴含多个知识点实例，而一类知识点可能在多个字头下文本中都有体现，如图 3所示。甚至在一个字头下的注释中包含了另一个字头的知识点。如：

“艸，百卉也。”卉下曰：“艸之总名也。”是谓转注。二屮、三屮，一也。引伸为艸稿、艸具之艸。“从二屮。”仓老切。古音在三部。俗以草为艸。乃别以皁为草。凡艸之属皆从艸。（《段注·艸部·艸》）

段氏在“艸”字下的注释就涉及转注、引申、古音、俗字等多个知识点，其中俗字知识点实例，还涉及到字头“草”。这就要求合理设计知识库的存储结构。

图3 《段注》文本与知识点及其实例的关系

1.4 《段注》知识库的结构

在知识库的实际构建中，我们采用关系数据库来存储知识体系与知识点实例。表6中的知识点范畴、知识集和知识点可以看作一套有层级的分类标签，这些标签间的层级关系就存储在知识体系表（Knowledge）中（表7）。“parent_id”是标识知识点层级结构的关键字段，通过外键约束机制与该表的“tag_id”字段形成自反引用关系，从而可以表示树状的分类模型。该字段在层级结构中的具体表现为：

第一层级，知识范畴（如“文字类”）的“parent_id”设为 NULL，作为根节点存在；

第二层级，知识集（如“六书”）通过“parent_id”指向其所属顶级分类的“tag_id”；

第三层级，知识点（如“象形”）则通过“parent_id”关联对应的二级分类“tag_id”。

除根节点外，每个知识节点必须有一个唯一的父节点（“parent_id”），形成“知识范畴→知识集→知识点”的严格层级架构。该设计方便数据更新和管理，当删除父标签时数据库会自动删除所有子标签；更新父标签ID时，相关引用自动更新。

在知识库中，每个知识点都有一张对应的实例表，存储所提取的该知识点的实例。根据实例所关联的对象类型，我们将知识点分为两类，不同类型的知识点实例在知识库中也采用不同的结构表示。

（1）单对象描述型知识点，内容仅涉及段玉裁对单个对象的特征、属性、状态的具体描述。“韵部”“构意”等均为此类。如“韵部”知识点实例“莹”的韵部是“十一部”，“构意”知识点实例“”的“构意”描述为“有物在一之下也”。对于此类知识点，实例表结构如表8所示。

（2）多对象关系型知识点，内容涉及对两个或多个对象（字头）间关系的描述，“音义同”“行废字”“古今字”“之言”等属此类。如段氏在“枼”字下注“叶与世音义俱相通”，表述了“叶”“世”两个词的“音近义通”的关系，属于“音义同”知识点实例；在“丕”字下注“故古多用不为丕”，沟通了“不”和“丕”的古今使用关系，属于“古今字”知识点实例。对于此类知识点，除了表8中的三个核心字段外，还增设了“对象1”（duixiang1）和“对象2”（duixiang2）两个字段，来表示对象间的关系（三个以上的对象关系则拆解为两两关系），如表9所示。

《段注》知识库应用平台的开发

《段注》知识库建设的目标在于促进学术资源的共享，以数字化技术赋能传统学术研究。为此，我们开发了一个交互式的《段注》知识库应用平台，提供三大功能模块：（1）原文阅读与检索，（2）知识导览与检索，（3）知识标注与管理。

2.1 原文阅读与检索

“原文阅读与检索”模块提供了两类检索：“字头检索”与“全文检索”。“字头检索”允许用户通过楷字搜索《段注》和《说文读》中的正篆或重文字头，检索结果是匹配到的字头及字头下的许氏说解（大字显示）和段氏注释（小字显示），见图4。“全文检索”允许用户在包括字头、说解和注释的全部文本内容中搜索字符或字符组合，检索结果是包含匹配内容的字头及字头下的说解和注释，并对匹配内容进行了高亮显示，见图5。点击检索结果中的字头，会跳转到该字头的原文阅读页面，见图6。

图 4 字头检索结果页面

原文阅读页面采取三栏式布局（图6）。页面左侧是目录区域，遵循原书结构，以“卷—部—字”为三级树形导航结构，使用户能够快速定位跳转至目标字头。中间的文本展示区，包含字头、字形、《段注》、《说文读》等信息，并提供高清原图链接。右侧的知识索引区，分为“文字”“音韵”“训诂”“校勘”“引书”五大类，呈现《段注》中该字头下的知识点实例。页面顶部也提供了便捷的检索框，输入内容后，直接重新搜索。

图 5 全文检索结果页面

图 6 原文阅读页面

2.2 知识导览与检索

“知识导览与检索”模块页面下方是整理的《段注》三层分类知识体系。段玉裁在注释《说文》时，可能有不同的用语来描述相同的语言现象；我们则在每组表达相同语言现象的用语中选择一个代表形式作为知识点的名称，而将其他用语作为其同义关联项。页面上方搜索框，可以帮助用户通过模糊的用语关联到对应的知识点。如，在搜索框键入“因”，用语中可能包含“因”的知识点（包括其解释和关联用语）会在页面底部出现（图7）。点击搜索结果，可以进入知识导览页面，并定位到该知识点（图8）。

图 7 知识检索结果页面示例

图8 知识导览页面示例(单对象描述性知识点)

知识导览页面同样采用三栏式布局（图8）。左侧导航区采用“知识范畴—知识集—知识点”三级目录结构，通过渐进式交互，帮助用户定位知识点。中间区域，呈现该知识点的具体实例。实例的呈现是以字头为单位的，内容包括字头下的许氏说解（大字显示）和段氏注释（小字显示），注释中与知识点匹配的内容以红色突出显示。对于多对象关系型知识点，还会呈现与之相关联的字头信息（图9）。右侧是知识点说明，提供概念说明以及相关研究文献的索引。

图9 知识导览页面示例(多对象关系性知识点)

说明：此处，页面内容显示的是“音义同”知识点。段玉裁在“皋”字下注释“葢古告皋嗥號四字音義皆同。”在知识点实例中，就存储了“告—皋”“告—嗥”“告—號”三组对象关系型实例。因此，在“告”字下，还显示与其“音义同”的“皋”“嗥”“號”三字。

2.2 知识标注与管理

“知识管理与标注”提供了三个功能：知识点的修改、知识点的增加和知识点实例的标注（图10）。点击“管理知识点”按钮，进入知识点修改功能，选择待修改的知识点，弹出对话框，可以修改该知识点所属的范畴（每一个知识范畴对应的知识集保持不变）和实例类型（图11）。点击“新增知识点”按钮，弹出对话框，可以添加新增知识点的各种属性（图12）。点击“开始标注”按钮，进入知识点实例标注功能。

图10 知识管理与标注页面

图11 修改知识点功能

图12 新增知识点功能

知识标注页面也是三栏式布局。左侧顶部检索框支持关键词或正则表达式检索，可以搜索注文中包含特定用语的字头。点击字头，中间栏显示该字头下的原文。对于单对象描述型知识点，可以选择相应的文字复制到右侧标注工作区的知识描述框，保存标注后，添加为该知识点的一个新增实例（图13）；对于多对象关系型知识点，还增加了对象组管理框，用以添加符合关系的对象组（图14），对象组是两两添加的，如果遇到三个或三个以上的对象关系，可以通过“新增对象组”利用将其分解为两两关系进行添加。

图13 知识标注页面(单对象描述型知识点)

图14 知识标注页面(多对象关系型知识点)

《段注》知识库应用举例

基于构建的《段注》知识库与应用平台，通过知识点间的关联与比照，可以系统展现段玉裁的研究成果，深入挖掘蕴含的学术思想。

以段玉裁的谐声归部为例。在完成了《段注》字头古韵归部与形声字声符的知识点标注后，我们将《段注》中所有形声字按其声符归入其所属的谐声系中，每一谐声系以“声首”为代表。如“照”的声符是“昭”；“昭”的声符是“召”；“召”的声符是“刀”；“刀”是一个非形声字，没有声符了，它就是“召”“昭”“照”的声首。以“刀”为直接声符（如“召”）或间接声符（如“昭”“照”）的所有形声字就构成了一个以“刀”为“声首”的谐声系，这个谐声系是有层次的。将“声首”按所属的古音韵部（即段玉裁的第一至十七部）进行排列，每个声首下按其声符层次展现谐声字，并在与“声首”不同韵部的谐声字后以括号标注其所属韵部（没有标注的谐声字与“声首”的韵部相同），这样就可以一览每个谐声系中的古韵韵部分布，如图15所示。

图 15 《段注》谐声系中韵部分布示例

说明：在《六书音韵表·古十七部谐声表》中，段玉裁将“求”放在第三部，以“求”为声符的形声字都被段玉裁归入第三部，但“求”字是被归入第一部的。此处根据“声首”韵部排列谐声系中，将其排在第一部中。

通过谐声系内韵部间的合韵次数计算,可以绘制出《段注》谐声偏旁分部互用的合韵热力矩阵（见图16）。段玉裁合韵说解释的是“音值相近的韵部，在古人用韵、文字使用、语词派生等方面会经常发生相通”的情况。此处计算的谐声系内韵部间的合韵次数是指谐声字与其声所属韵部不同的情况，如“弭”在第一部，以“弭”为声符的谐声字“麛”在十六部，则第一部与第十六部算作合韵一次。图16 矩阵中的数值就表示对应行、列所代表的韵部在谐声网络中发生合韵的次数，是一个沿主对角线对称的矩阵，次数越多，颜色越深。

段玉裁是第一个打破《广韵》次第，按古音远近关系排列韵部次序的。两个韵部之间的合韵的次数越多，则古音就可能就越相近，越应该相邻排列。反映在图16中，合韵次数多的颜色较深的区域就主要集中在主对角线的两侧，可见段氏古音框架构建的科学性。其中比较特殊的是第一部与第二部，第五部和第六部，第八部与第九部，虽然相邻，但几乎不发生合韵。段玉裁在《古十七部分合用类分表》中，将十七部分为六类（第一部为第一类；第二部至第五部为第二类，第六部至第八部为第三类，第九部至第十一部为第四类，第十二部至第十四部为第五类，第十五部至第十七部为第六类），各大类的区分主要以同韵尾为标准。其中第一和第二部第五和第六部，第八和第九部就是前四类的分界。合韵次数最多的是第十五和第十六部，第十二和第十三部，第十四部和十五部，第七和第八部。

段氏的十五部和十六部就是被钱大昕赞誉为“凿破混沌”的“支之脂三分”中的“脂”部和“支”部，一部则为“之”部。戴震对段氏“支脂之三分”的发现也是赞不绝口，认为“此说为确论”，但同时认为“支”“脂”“之”三部应该放在一起，不应“之”部列第一，而“脂”“支”远在第十五部和第十六部。从图16数据来看，第一部和第十五部、第十六部合韵次数均较少。第十二部和十三部则对应段氏古韵分部的另一贡献“真文分部”中的“真”部和“文”部。江有诰评价“真文分部”时说道：“段氏之分真、文，人皆疑之，有诰初亦不之信也，细抽绎之，真与耕通用为多，文与元合用较广，此真、文之界限也。”其中与“真”通用为多的“耕”部就是段氏第十一部，而与“元”合用为广的“文”部则是段氏第十四部，江有诰之说也可以从图16中得到验证。第七部和第八部则是十七部中唯二的收唇韵，以-m、-p为韵尾。十七部中与其他韵部发生合韵最多的是第十五部（每行合韵次数相加），这与段氏对第十五部的界分过宽有关。在段氏的异平同入框架下，第十五部是包含入声韵的，且第十三部和第十四部皆以十五部之入为入。

图16 《段注》谐声偏旁分部互用的合韵热力矩阵

前人关于段玉裁谐声归部的研究主要集中于三个方面：一，《段注》字下所注韵部与《六书音韵表》的差异比较，如周祖谟、何九盈等；二，段氏对谐声字古音归部的处理原则与方法，如刘忠华；三，“谐声偏旁分别部居”所体现的韵部间的远近关系，如郭必之。较全面的研究有张道俊的《<说文解字注>古韵订补》与《<说文解字注>古韵归部》，前者对《段注》中韵部未标、错标以及游移字的情况进行了订补校勘；后者通过对比，探讨了段氏古音归部的依据以及归部标准的变化。

《段注》知识库的构建，不仅可以贮存前人的研究成果，还可以推动相关研究路径的三重突破：其一，支持对段氏谐声归部体系的系统性检验，通过批量比对与统计分析，揭示其内在逻辑与潜在矛盾；其二，借助关联查询与可视化技术，为其理论与方法提供直观的数据支撑；其三，知识库的开放性与可扩展性，有利于后续整合构形、音韵、训诂等多维数据，拓展研究的深度与广度。

结语

段玉裁《说文解字注》知识库以段玉裁《说文解字注》与《说文解字读》为底本，系统梳理其内在的结构，构建了涵盖文字、音韵、训诂、校勘和引书五大范畴的三层分类知识体系，并完成了五十六个知识点、数万条实例的结构化提取与存储。在此基础上开发的交互式应用平台，不仅支持原文检索与知识导览功能，更提供了开放式标注与管理功能，支持知识库的可持续建设。

展望未来，研究仍有多方面的深化空间。知识层面，可进一步扩充专题知识集，构建更为丰富、立体的学术语境；技术层面，可探索引入自然语言处理技术，辅助知识点实例的半自动标注与关系抽取，提升知识库构建的智能化水平；应用层面，可开发面向不同用户群体的接口,如为专业学者提供网络分析等深度研究工具，推动数字化研究范式的转型。此外，知识库的可持续发展机制也需进一步探索，如建立协同标注模式，吸引领域专家通过在线平台参与审核、补充与讨论、保障知识库的活力与权威性。我们期望《段注》知识库能够逐步成长为一个动态演进、持续完善的数字学术基础设施，为中国传统语言文字学传承、发展与创新提供扎实资源支撑与方法赋能。

排版：黄琬喻

初审：徐碧姗

复审：段婧怡

终审：夏翠娟

阅读原文

跳转微信打开

美 - 加州大学洛杉矶分校（UCLA）

Get Help Making Your PDF’s Accessible with Humtech

作者Jordan Galczynski

2026年4月17日 04:59

image of the moon in three phases with superimposed text

In an effort to meet web accessibility standards, HumTech has developed a new AI-powered tool currently called “Repair PDF,” where previously scanned PDFs are rebuilt by adding a searchable text...

美 - 弗吉尼亚大学（Scholars'Lab UVA）

Prosopo-what-now?

作者adnan-zarif

2026年4月16日 12:00

I’m very belatedly joining the Praxis Fellows blog after more than a semester of meaning to write this post and not writing it. Let me start with the word that has been keeping me busy instead.

Prosopopoeia

Pro-so-po-PEE-a. It comes from the Greek prosopon (“face,” “mask”) and poiein (“to make”), and it means, roughly, to give a face to something that doesn’t have one—or more precisely, to give speech to something that cannot speak. (The Greek word for “make” is also where we get the word “poem”—literally “a made thing.”) When a poet makes a dead person talk, that’s prosopopoeia. When the poet makes the wind talk, or a river, or a nation— prosopopoeia. It’s one of the oldest tricks in the rhetorical toolkit, and one of the strangest: the fiction that things which have no voice—or rather, a voice which we are not accustomed to listening for—can be “given” one by the writer.

What I’d like to do in this post is walk through a few examples of prosopopoeia to give you a sense of why this figure fascinates me and why I think it matters so much.

A courtroom ghost

Let’s begin in a Roman courtroom. In 56 BCE, Cicero was defending a young man named Marcus Caelius Rufus against charges of political violence. Cicero’s strategy was to reframe the whole prosecution as the scheme of a scorned lover—Clodia Metelli, a powerful patrician widow who had been Caelius’s mistress. To discredit Clodia, Cicero did something audacious: he announced that he would summon from the dead her own ancestor, Appius Claudius Caecus—the famous blind censor who built the Via Appia, Rome’s first great road—and let him address her directly.

What follows is one of the most celebrated prosopopoeiae in classical rhetoric. Cicero assumes Appius’s voice directly and has him thunder at Clodia: Woman, what business have you with Caelius? What business with a young boy? The address is deliberately humiliating. The Latin word Cicero uses, “mulier” (“Woman!”), strips Clodia of her patrician dignity. The jurors know these are Cicero’s words, not Appius’s, but when placed in his mouth they take on, almost by alchemy, a new kind of power. Prosopopoeia, in this instance, is able to create a sense of moral authority, seemingly out of thin air.

A silent urn’s subversive speech

If the Cicero example shows prosopopoeia as a deliberate rhetorical strategy, Keats’s “Ode on a Grecian Urn” shows how the figure can be a force of destabilization—how the question of who is speaking can even become unanswerable.

You probably know the poem’s famous last lines: “Beauty is truth, truth beauty.” But here are the closing lines, in the version most modern anthologies print:

When old age shall this generation waste, Thou shalt remain, in midst of other woe Than ours, a friend to man, to whom thou say’st, “Beauty is truth, truth beauty,” —that is all Ye know on earth, and all ye need to know.

With the quotation marks around the phrase “Beauty is truth, truth beauty,” it seems quite clear that the urn is given a chance to speak and offer its profound metaphysical proposition on the nature of beauty and truth. The trouble is that those quotation marks do not appear in transcripts of the poem made by Keats’s friends, nor do they appear in other printed versions. This has led to major critical disagreement over how much speech can really be attributed to the urn. Are the lines “Beauty is truth, truth beauty,” said by the urn with the rest attributed to the lyric speaker of the poem or is the urn meant to be saying the entirety of the last two lines? Without any quotation marks indicating who is saying what, is the “ye” in the final line addressing the speaker of the poem, the readers of the poem, the urn, or the painted figures on the urn itself?

The instability of the speaker gives way to an instability of tone. Is the statement about beauty and truth a profound metaphysical proposition? Or is it a comic overstatement representing the naive and narrow point of view of the urn? The shifting quotation marks across manuscripts and editions are the material trace of a question prosopopoeia always raises: when we give speech to something that cannot speak, whose voice are we really hearing? And how exactly are we meant to hear them?

The sled dog, the bee, and the pig

This is where my dissertation comes in. The first chapter focuses on what I’m calling “animal persona poems”—contemporary poems written from the perspective of nonhuman animals. These poems occupy a strange position relative to prosopopoeia. Strictly speaking, they’re dramatic monologues: a poet assumes a character and speaks as that character, the way Browning speaks as the Duke of Ferrara in “My Last Duchess.” But the character is a sled dog, or a bee, or a pig. And the fact that the character is an animal changes everything about how the ventriloquism works.

Consider Timothy Donnelly’s “Malamute,” from his collection The Problem of the Many (2019). The poem is written in long, winding, syntactically complex sentences—each stanza essentially one sprawling clause accumulating subordinate phrases. Every verse paragraph opens with the same refrain: “When I was a dog I pulled the sled with the other dogs.” That past tense is crucial. This isn’t a dog speaking in the moment; it’s a being that once inhabited a dog’s body and now reflects from some other vantage point. The dog pulled the sled with the other dogs “who didn’t know I didn’t know”—it conceals its disquiet, its not-belonging, while the elaborate syntax enacts an interiority far exceeding anything we’d attribute to a naturalistic animal consciousness. The poem doesn’t pretend to offer transparent access to a dog’s mind. It foregrounds its own artifice.

Or take “As Bee,” by Paula Bohince. Here the speaker is a bee—polite, reflective, stoically aware of its own brevity. “This is my first and only spring on earth. / I get it.” The bee wonders parenthetically about its own mother—”(did I have one?)”—which captures both biological reality (bees don’t have individual mothers in any meaningful sense) and deeper emotional uncertainty. What strikes me about this poem is the “As” in the title, which marks the poem as a performance, a speaking-as, not a speaking-from.

And there’s Margaret Atwood’s “Pig Song,” from You Are Happy (1974) which features a sequence called “Songs of the Transformed” in which nine speakers are animals who have been changed by “Madame.” The pig describes itself in grotesque imagery—”a greypink vegetable with slug / eyes, buttock / incarnate, spreading like a slow turnip”—and then turns on its captor: “Madame, this song offends you, these grunts / which you find oppressively sexual, mistaking / simple greed for lust.” The poem’s final lines crystallize something about prosopopoeia itself: “I am yours. If you feed me garbage, / I will sing a song of garbage. / This is a hymn.” The pig will mirror back exactly what it receives, and the mirroring becomes sacred utterance. The voice given to the degraded creature becomes the vehicle for its resistance.

The paradox

What connects all these examples—the Roman courtroom, the Grecian urn, the sled dog and the bee and the pig—is a shared paradox. Speaking for what cannot speak or what cannot be thought to speak is simultaneously an act of imaginative generosity and an act of appropriation. The voice that emerges always belongs to two speakers at once—the ostensible speaker (the ancestor, the urn, the animal) and the real one (the orator, the poet). And the uncertainty about whose voice we’re really hearing is prosopopoeia’s deepest feature.

For my dissertation, this paradox becomes especially charged when the subject is an animal. Animals do have voices—they bark and sing and grunt and click, not to mention the huge range of gestures and non-speech acts they perform. They communicate, even if we can’t fully understand what they’re saying. So the poet who writes in the voice of a dog or a bee isn’t conjuring speech from silence, the way Keats does with his urn. The poet is replacing one kind of utterance (the animal’s actual, opaque expressiveness) with another (human language). The animal already has a face. The poet is putting a new one on top of it. And the most interesting animal poems, I think, are the ones that know this—that build the awareness of their own ventriloquism into their formal structures.

More to come. My dissertation explores what happens to poetic language when it tries to cross the species boundary, and part of what excites me about being a Praxis Fellow is thinking about how digital methods might shape that inquiry—how you curate a corpus of poems, how you present and sequence them, and what it means to build a digital environment that lets readers encounter those poems rather than just read about them. In my next post, I’ll write about a workshop I recently ran in which participants assembled their own anthologies of Aesop’s fables and then designed pen-and-paper wireframes for how they’d present them as websites. Until then, thank you for reading.

11 - 日本数字人文协会（JADH）

DH2026に参加申込みをされるJADH会員の皆様へ

作者adminweb

2026年4月17日 08:59

DH2026に参加申込みをされるJADH会員の皆様におかれましては、参加申込みの途中で、所属組織のコードを求められることがあると思います。
その際には、secretary_at_jadh.org (_at_を@に置き換えてください) にメールをお送りください。JADH事務局の方でコード発給に関する手続きを行ないます。

よろしくお願いいたします。

12 - 艺术与人文数字研究基础设施（DARIAH）

Transformations: Call for proposals for Special Issues

作者Eliza Papaki

2026年4月16日 15:19

Transformations: A DARIAH Journal invites proposals for a special issue. Alongside our annual regular thematic issue, Transformations: A DARIAH Journal publishes special issues that open up additional conversations in the arts and humanities (see our online Procedures for Special Issues).

We hereby invite proposals that:

make a clear scholarly case for a focused theme
build an engaged community of authors and readers
can be delivered within realistic editorial and review capacity

Special issues follow the same editorial standards and external peer review as regular articles. Guest editors may submit articles, but such submissions will be handled independently and may not exceed 25% of the final special issue.

All special issues:

are overseen by the Editor-in-Chief and Editorial Board
receive the same peer review and editorial scrutiny as regular articles
are published with clear labelling and transparent editorial roles

Multilingual special issues

Transformations actively welcomes proposals for multilingual special issues. They are subject to the same editorial oversight, peer-review standards, and publication ethics as all other contributions to Transformations.

Special issues may be:

fully in English
fully in a language other than English

Proposals for multilingual issues should:

explain the scholarly rationale for the chosen language(s)
identify the intended scholarly community
demonstrate the availability of qualified reviewers in the relevant language(s)

Who can apply

Proposals are welcome from individuals or teams, from any institution or country, including interdisciplinary and cross-sector editorial teams.

How to apply

Please fill-in this template form and submit it (in PDF) to transformations@episciences.org with the subject line: Special Issue Proposal.

Timeline

Call opens: April 2026
Deadline for special issue applications: 30 June 2026
Information about the decision: 15 July 2026
Call for papers / invitations / preparation of contributions: 15 July 2026 – May 2027
Peer review: June – Sept 2027
Copy editing & Formatting: Oct – Nov 2027
Target publication: December 2027

4 - 德语区数字人文研究协会博客（DHd-Blog）

Virtuelles DH-Kolloquium an der BBAW, 27.04.2026: Historische Fremdsprachenlehrwerke digital. Erschließung historischer, mehrsprachiger Quellen: Kontext und Herausforderungen

作者Johannes Ioannu

2026年4月17日 21:16

Im Rahmen des DH-Kolloquiums an der BBAW laden wir Sie herzlich zum nächsten Termin am Montag, den 27. April 2026, 16 Uhr c.t., ein (virtueller Raum: https://meet.academiccloud.de/gl/rooms/lou-eyn-nm6-t6b/join)

Janine Rauchmann (TU Darmstadt), Dr. Kerstin Roth (Universität Hamburg),
Lisa Scharrer (TU Darmstadt), Dr. Josephine Ulbricht (BBAW)

Das Akademienvorhaben „Historische Fremdsprachenlehrwerke digital“ (FSL digital) widmet sich erstmalig der Volltexterschließung, korpuslinguistischen Aufbereitung, Annotation, digitalen Vernetzung sowie der sprach-, kultur- und wissenshistorischen Auswertung von mehrsprachigen Fremdsprachenlehrwerken aus der Frühen Neuzeit (15. – 17. Jahrhundert). Fremdsprachenlehrwerke der Frühen Neuzeit, die bis zu zehn Vernakularsprachen parallel abbilden, stellen eine genuin mehrsprachige Textsorte mit komplexem Layout dar. Somit bringen diese Quellen besondere Herausforderungen bei ihrer digitalen Aufbereitung mit sich. Um der inhaltlichen und strukturellen Heterogenität der Lehrwerke gerecht zu werden, bedarf es eines geeigneten Ansatzes zur (automatisierten) Texterkennung, der auch zukünftige Aspekte der Text- und Layoutannotation frühzeitig mitdenkt. In diesem Zusammenhang möchten wir die Quellengruppe und den aktuellen Workflow im Projekt vorstellen, welcher angesichts der großen Datenmenge einen Work-in-Progress darstellt und neben Text- und Layouterkennung auch das Postprocessing des OCR-Outputs sowie erste Ansätze zur Textannotation umfasst.

***

Die Veranstaltung findet virtuell statt; eine Anmeldung ist nicht notwendig. Zum Termin ist der virtuelle Konferenzrraum über den Link https://meet.academiccloud.de/gl/rooms/lou-eyn-nm6-t6b/join erreichbar. Wir möchten Sie bitten, bei Eintritt in den Raum Mikrofon und Kamera zu deaktivieren. Nach Beginn der Diskussion können Wortmeldungen durch das Aktivieren der Kamera signalisiert werden.

Der Fokus der Veranstaltung liegt sowohl auf praxisnahen Themen und konkreten Anwendungsbeispielen als auch auf der kritischen Reflexion digitaler geisteswissenschaftlicher Forschung. Weitere Informationen finden Sie auf der Website der BBAW.

4 - 德语区数字人文研究协会博客（DHd-Blog）

Stellenausschreibung Wissenschaftlichr Mitarbeiterin im Bereich Digitale Musikwissenschaft

作者Anna Neovesky

2026年4月17日 18:38

Wissenschaftliche*r Mitarbeiter*in im Forschungsprojekt „MusicDNA“

E 13 TV-L | Umfang: 75%-Teilzeit / 30 Stunden | Beginn: ab 1.6.2026 | Befristet bis: 31.7.2027

Bewerbungen bis zum 22.04.2026 über das Online-Portal der Fachhochschule Erfurt: https://jobs.fh-erfurt.de/otdsm

Das Projekt MusicDNA
Das Projekt MusicDNA zielt darauf ab, den Medienbruch zwischen Noten in Partituren und ihrer akustischen Manifestation zu überbrücken. Es geht darum, Audioaufnahmen und notierte Musik zugleich zu durchsuchen. Eine wichtige konzeptionelle Rolle spielen dabei Incipits und ihr Notationsformat „Plaine & Easie Code“. Ausgangspunkte sind katalogisierte Quellen und Partituren sowie der Web-Service IncipitSearch, der Inhalte aus Musikkatalogen aggregiert und durchsuchbar macht. Das Projekt MusicDNA wird durch die Carl-Zeiss-Stiftung gefördert und findet in Kooperation mit dem Fraunhofer-Institut für Digitale Medientechnologie IDMT statt.

Deine Aufgabe: Du entwickelst im Projekt einen Prototyp, mit dem Musikkataloge mittels Summen einer Melodie (Query-by-Humming) durchsucht und Audioaufnahmen in notierte musikalische Incipits umgewandelt werden können.

Deine Mission

Du konzipierst und entwickelst eine webbasierte Software-Lösung mit Query-by-Humming-Funktionalität.
Du entwickelst ein Verfahren zur Umwandlung von Noten aus Audio zu Plaine & Easie.
Du ergreifst Maßnahmen zur Sicherung der Daten- und Software-Qualität.
Du vernetzt dich mit den Akteur:innen in der Fachhochschule, den Kooperationspartnern und der Fachcommunity.

Das bringst du mit

ein abgeschlossenes Masterstudium im Bereich Angewandte Informatik, Digital Humanities mit dem Schwerpunkt Digitale Musikwissenschaft oder Musikwissenschaft mit nachgewiesener Erfahrung in der Software-Entwicklung
vertiefte Kenntnisse im Bereich Webentwicklung und Webtechnologien, insbesondere Javascript oder Python
Kenntnisse im Bereich Musik und Musikwissenschaft
Deutschkenntnisse in Wort und Schrift (mindestens auf dem GER-Sprachniveau C1) und Englischkenntnisse mindestens auf dem GER-Sprachniveau B1 oder Englischkenntnisse in Wort und Schrift (mindestens auf dem GER-Sprachniveau C1) und Deutschkenntnisse mindestens auf dem GER-Sprachniveau B1
Erfahrung in der Umsetzung von Software- oder Webprojekten
hohe Eigenmotivation zur kreativen und eigenständigen Aufgabenerledigung sowie ausgeprägte Kommunikationskompetenz

Das bieten wir dir

Vergütung nach TV-L (E 13), betriebliche Altersvorsorge und Sonderzahlungen
flexible Arbeitszeiten, die zu Deinem Leben passen, und die Möglichkeit, mobil zu arbeiten
eine moderne Campus-Hochschule mit einem engagierten und diversen Team
persönliche Weiterentwicklung durch interne und externe Fortbildungen sowie betriebliches Gesundheitsmanagement

Das Projekt ist an der hochschulübergreifenden Professur für Digital Humanities – Hybride Bildungs- und Kommunikationsräume angesiedelt (Fachhochschule Erfurt und Universität Erfurt).
Weitere Informationen zur Professur: https://www.uni-erfurt.de/philosophische-fakultaet/seminare-professuren/historisches-seminar/professuren/digital-humanities-hybride-bildungs-und-kommunikationsraeume

Deine Bewerbung

Bereit für den nächsten Karriereschritt? Dann bewirb Dich jetzt bis zum 22.04.2026 über das Online-Portal.
Stellenausschreibung: https://jobs.fh-erfurt.de/otdsm
Bewerbungsformular: https://jobs.fh-erfurt.de/15skq

Jetzt bewerben. Wir freuen uns auf Dich!

Vielfalt ist unsere Stärke. Bewerber:innen mit Schwerbehinderung werden bei gleicher Eignung bevorzugt berücksichtigt. Wir begrüßen besonders Bewerbungen von Frauen und auch Menschen mit internationalem Abschluss. Von letzteren ist eine Zeugnisbewertung der Zentralstelle für ausländisches Bildungswesen (www.kmk.org/zab) erforderlich.

Die Auswahlgespräche sind für Mitte Mai 2026 geplant.

4 - 德语区数字人文研究协会博客（DHd-Blog）

GND-Forum Humanities@NFDI: 23./24.06.26 in Göttingen

作者Stefan Buddenbohm

2026年4月17日 16:03

Mitwirkende der vier geisteswissenschaftlichen Konsortien der Nationalen Forschungsdateninfrastruktur (NFDI) treffen sich zum GND-Forum Humanities@NFDI am 23./24. Juni in Göttingen. Zentrales Thema des Arbeitstreffens ist die Gemeinsame Normdatei (GND) aus unterschiedlichen Blickwinkeln.

Die Veranstalter:innen

Wir organisieren das GND-Forum Humanities@NFDI als Team: Barbara Fischer (Deutsche Nationalbibliothek, GND-Community), Stefan Buddenbohm & Alex Steckel (SUB Göttingen als Gastgeberin sowie Text+). Melanie Gruss & Desiree Mayer (NFDI4Culture), Anja Gerber & Frank von Hagel (NFDI4Objects), Katrin Moeller und Anne Purschwitz (NFDI4Memory), Domenic Schäfer & Michael Markert (VZG des GBV).

Die GND-Foren

Mit dem Veranstaltungsformat FORUM verfolgen wir, d.h. die GND-Community, das Ziel, mit euch gemeinsam – den Nutzenden, den Forschenden, den Infrastrukturbetreibenden – über konkrete Anwendungsszenarien der GND nachzudenken.

GND-Forum Humanities@NFDI

Das Selbstverständnis der GND ist es Brücken zu schlagen zwischen unterschiedlichen Ressourcen und Communities. Wie kann ihre Doppelrolle als Datenbank und Organisation beim Ausbau der Zusammenarbeit der vier geisteswissenschaftlichen Konsortien NFDI4Culture, NFDI4Memory, NFDI4Objects und Text+ helfen, ist die zentrale Frage in Göttingen. Hintergrund der Frage sind zum einen die für 2028 erwartete Neuausrichtung des NFDI-Förderprogramms, und zum anderen wie das Synergienpotenzial der bisherigen Entwicklungen in den jeweiligen Konsortien rund um die GND für die anderen Partner genutzt werden kann. Beides soll aus drei Blickwinkeln diskutiert werden:

Die Rolle der GND in Tools und Services der Konsortien
Unterschiedliche Perspektiven auf vier Entitätstypen der GND
Zukunftspotenzial der vertieften Zusammenarbeit der Konsortien mittels der GND

Themen

Wir wollen Euch die Möglichkeit geben, euch aktiv einzubringen und schlagen den folgenden programmatischen Dreiklang vor:

Tools: Welche Werkzeuge für welche Zwecke?
Die GND im Schulterblick: Tandems aus GND- und NFDI-Kolleg:innen stellen die Entitätentypen der GND im Anwendungskontext vor
Strategie: Pfade für die GND in der NFDI

Anmeldung

Bitte meldet Euch sich bis zum 16.06. über die Anmeldefunktion dieser Veranstaltungsseite an. Wir haben 80 Plätze für Teilnehmende reserviert. Die Teilnahme am GND-Forum ist kostenfrei.