普通视图

Received before yesterday1 - 《数字人文》

这些数据中是否存在文本?——数字人文与证据保存

2023年3月1日 00:00
作者:
单位:
Abstract: The "digital humanities" umbrella shelters scholars curious about novel computer-mediated analysands—software, computer games, works of digital art and literature, social media, online-only forms such as the video supercut, and so forth—as well as scholars applying computational analysis methods to text, image, sound, and video corpora both small and unimaginably large. Nearly all of these scholars discover that fitting their work and its associated evidence into the humanities’ present print-centered scholarly communication system—is there a readable, reviewable,(print-)publishable, citable,immutable, preservable text in these data?—carries serious challenges. Until the humanities consciously break the hegemony and path dependency of print, digital humanists will remain alienated from the rest of the humanities, preventing the humanities from adopting open processes such as data sharing and open-access publishing. In turn, this harms the reach and sustainability of the humanities as a whole.MoreReset

专门领域中文文本的无监督分析

2023年3月1日 00:00
作者:
单位:
Abstract: With the growing availability of digitized text data both publicly and privately,there is a great need for ef fective computational tools to automatically extract information from texts. Because the Chinese language differs most significantly from alphabet-based languages in not specifying word boundaries, most existing Chinese text-mining methods require a prespecified vocabulary and/or a large relevant training corpus, which may not be available in some applications. We introduce an unsupervised method, top-down word discovery and segmentation(TopWORDS), for simultaneously discovering and segmenting words and phrases from large volumes of unstructured Chinese texts, and propose ways to order discovered words and conduct higher-level context analyses. TopWORDS is particularly useful for mining online and domain-specific texts where the underlying vocabulary is unknown or the texts of interest dif fer significantly from available training corpora. When outputs from TopWORDS are fed into context analysis tools such as topic modeling, word embedding, and association pattern finding, the results are as good as or better than that from using outputs of a supervised segmentation method.MoreReset

文本挖掘与码库思:以朝鲜译官在清朝的贸易网络为例

2023年3月1日 00:00
作者:
单位:
Abstract: This article presents a study on the trade networks reported in Korean envoys’ travelogues Y?nhaengnok(《燕行录》, Chos?n envoys’ travelogues) using the relational annotation function in the text analysis and reading platform MARKUS. The tributary relations(Ch: chaogong guanxi Kr: chogong kwan’gye 朝贡关系) established between the Qing and Chos?n dynasties not only defined the nature of their political ties but also shaped their economic connections. Many studies have shown that participants in the Chos?n tribute embassy undertook private trade and smuggling with Qing merchants. By mapping trade networks using the MARKUS and Docu Sky platforms in combination, my research examines how Chos?n interpreters, one of the most vital components in the Chos?n tribute embassy, undertook non-official trade in Qing China. The analysis concentrates on issues such as with whom the Chos?n merchants traded, what kind of commodities they bought, and where they traded. This study makes significant methodological contributions to the literature, illustrating the utility of examining annotation-based networks built in the MARKUS and Docu Sky environments, which involve entity extraction, creation of an ontology for network building, relational annotation, and visualization of networks and GIS data.MoreReset

作为跨学科的人文计算

2023年3月1日 00:00
作者:
单位:
Abstract: <正>引言为什么要提出这个问题?本次研讨会在议题中就提出了一个问题:“人文计算是一门学科吗?”当被问到这个问题时,人们通常在对其潜在假设知之甚少或者根本不了解的情况下表态。由于这些假设在很大程度上仍未经检验,它们往往会误导我们对作为计算人文学者所做工作的思考,并影响了我们有力地谈论人文计算的可行性,我将致力于分解这个问题,审视其核心假设,以探讨是否可以提出更具深度的问题。在追求更具深度的问题时,我还将提及一些需要纳入我们研究范围的学科领域。MoreReset

古籍标点与专名的智能识别技术研究

2023年3月1日 00:00
作者:
单位:
Abstract: Sentence punctuation and entity recognition are important steps in the process of collating and publishing ancient Chinese books. In recent years, with the development of artificial intelligence technology, automatic punctuation has achieved considerable progress,and the name entity recognition has also received more and more attention. Considering the knowledge dependence between the tasks, this paper proposes a joint learning method based on deep neural networks. First, we pre-train the language model with large-scale ancient Chinese corpus to equip the model with grammatical and semantic knowledge of ancient Chinese. Second, we introduce a joint learning mechanism to enable the model to learn multiple tasks at the same time, and use the data augmentation strategy to alleviate the problem of insufficient training data. With only one model, our method can automatically label various types of tags such as punctuation, quotation marks, book names, place names,person names, and dynasties with high accuracies. On multi-domain test set, our method reaches an F1 score of higher than 94% on automatic sentence segmentation task, 85% on automatic punctuation task, 87% on name entity recognition task. The system based on our method can be publically accessed at https://seg.shenshen.wiki/.MoreReset

动态视图在文字编管理系统中的应用研究

2023年3月1日 00:00
作者:
单位:
Abstract: Under the background of digitalization, the technology of using program to compile text has been well applied, but there are still some shortcomings in data management, such as not timely feedback of modification results and lack of data browsing platform. In order to solve these problems, this paper puts forward a new system design scheme: by managing data in the way of dynamic view, the system can have both functions of reading and editing, and combine various data processing modes to learn from each other and meet the needs of multiple layers.MoreReset

金庸文本海外译介的评论挖掘与情感分析

2023年3月1日 00:00
作者:
单位:
Abstract: The overseas readers’ reviews on reading community networks reflect their views and emotions towards book’s translation. They are an important basis for judging the quality of the translation, and also the most intuitive embodiment of the overseas communication ef fect of the translation, thus of great significance to the communication of our domestic literature and culture. This paper takes the comments of the four volumes of the English translation of Legends of the Condor Heroes –A Hero Born, A Bond Undone, A Snake Lies Waiting and A Heart Divided, as the research objects, and uses the methods of text mining and emotional analysis to investigate the overseas communication ef fect of this novel of Jin Yong’s to provide reference for the translation and communication of other literary works.MoreReset

校外教育培训机构治理的关注热点与反应分析——基于“知乎”论坛10,124条话题的数据分析

2023年3月1日 00:00
作者:
单位:
Abstract: With the promulgation and implementation of the "Double Reduction" policy,the public express their opinions on open platforms. And the special governance of afterschool training institution has once again triggered a heated discussion. Using Python,this study collected 10,124 texts about ’governance of after-school training institution’ on "Zhihu" platform, and analyzes through LDA text topic and sentiment analysis model. The results show that the public discussion of governance focuses on student development,teacher treatment, social issues, education reform, education equity and other aspects. The public generally supports the special management of after-school training institutions, but a minority hold a negative attitude towards this. They worry about the inadequate governance will exacerbate the education gap. In addition, educators are the most optimistic and supportive of the governance, but students and other social members are pessimistic about the actions and results of the governance.MoreReset

博物馆与数字文化:新冠肺炎疫情时期下从现实到数字化

2023年3月1日 00:00
作者:
单位:
Abstract: Museums increasingly recognize the need to address advances in digital culture which impact the expectations and needs of their audiences. Museum collections of real objects need to be presented both on their own premises and digitally online, especially as digital and social media becomes more and more influential in people’s everyday lives. From interdisciplinary perspectives across digital culture, art, and technology, we investigate these challenges magnified by advances in digital and computational media and culture, looking particularly at recent and relevant reports on changes in the ways museums interact with the public. We focus on human digital behavior, experience, and interaction in museums in the context of art, artists, and human engagement with art,using the observational perspectives of the authors as a basis for discussion. Our research shows that the COVID-19 pandemic has accelerated many of the changes driving museum transformation, about which this paper presents a landscape view of its characteristics and challenges. Our evidence shows that museums will need to be more prepared than ever to adapt to unabated technological advances set in the midst of cultural and social revolution,now intrinsic to the digital landscape in which museums are inevitably connected and participating across the global digital ecosystem where they inevitably find themselves entrenched, underscoring the central importance of an inclusive integrative museum model between physical and digital reality.MoreReset

基于楚国纹样拆分的智能生成研究

2023年3月1日 00:00
作者:
单位:
Abstract: Patterns of the state of Chu is an important part of Chinese art history. However,in the past, the research on patterns of the state of Chu mainly focused on the analysis and interpretation of the meanings of patterns, and lacked a general understanding of the rules of it. Through the analysis of the constituent units of the patterns of the state of Chu, this paper constructs the functional structure code of the patterns of the state of Chu, and thus forms a database containing information on the split of 270 patterns of the state of Chu. On this basis, this paper uses the automatic operation of artificial intelligence to automatically combine the given pattern units into patterns with aesthetic value, realizing the innovative application of the patterns of Chu State. The ideas and methods of this study are instructive for the innovative inheritance of the patterns of the Chu State and even the general traditional culture.MoreReset

国外数字人文教科书中的中国案例及其对中国形象的构建

2023年3月1日 00:00
作者:
单位:
Abstract: There are many digital humanities projects and practical application cases from China in foreign digital humanities textbooks, which play a certain role in building the image of China. Taking Johanna Drucker’s The Digital Humanities Coursebook: An Introduction to Digital Methods for Research and Scholarship as the research object,we focuse on several Chinese cases, including Chinese historical photos, the spacetime infrastructure of Chinese civilization, Chinese poetry exchange, three-dimensional Dunhuang Mogao Grottoes, and ancient Chinese programming language. We find that: Most of these cases come from foreign project achievements, and Chinese original cases have not attracted enough attention; The case involves and demonstrates the multi-dimensional history, geography and culture of China, and constructs a three-dimensional national image;The cases are distributed in the digital humanities projects with diversified technologies and methods, with wide application penetration and great communication influence.MoreReset

系列笔谈之七:古籍目录数据库建设

2023年3月1日 00:00
作者:
单位:
Abstract: <正>孙显斌(中国科学院自然科学史研究所):古籍目录数据库建设与应用古籍数据库的应用,主要有两个方面:分别是学术研究和文化普及。就古籍目录数据库而言,目前主要还是学术方面的应用。既然如此,我们就要回顾古典目录学的“初心”是什么,也就是它如何支撑学术研究。搞清楚这些后,就给我们建设古籍目录数据库提出了基本任务。如果我们用数字人文的方法,连传统目录学能够支撑的学术研究都做不到的话,那显然是不能让人满意的。MoreReset

编后语

2023年3月1日 00:00
作者:
单位:
Abstract: <正>信息数字化工程、文献数据库建设一直是数字人文基础设施建设的重中之重。对于一些“圈外人”来说,这甚至几乎等同于数字人文本身。本刊一直重视这方面的工作,同时也认为许多时候人们只看到数字化的结果,而其社会、文化、政治和经济层面却被遮蔽了。随着学术研究越来越依赖数字资源,我们需要思考数字化实践如何塑造了知识建构的基础,从而对其进行思辨性的认知,而不是无条件地作为既成的知识体系来接受。自创刊以来,本刊即有意识地推进思辨的数字人文基础设施研究,这在本期部分栏目中也有所体现,今后我们还计划推出专刊讨论这方面的问题。MoreReset

征稿启事

2023年3月1日 00:00
作者:本刊编辑部
单位:
Abstract: <正>基于以学科交叉促进学术创新发展的理念,清华大学、中华书局联合主办《数字人文》(Journal of Digital Humanities)学刊,旨在为方兴未艾的数字人文研究提供理论探讨和专题研究的平台。本刊接受中文及英文稿件,稿件类型包括人文、艺术、教育诸学科数字人文研究论文及国内外有关资讯(如书讯和书评、会议及项目通讯、数据库等基础设施评介、数字人文教学案例等)。MoreReset

干支与吉凶——金文所见周人择日方法的一种模型

2023年2月1日 00:00
作者:
单位:
Abstract: Hemerology culture was very popular in early China.People used oracle bone divination to choose propitious days in the Shang Dynasty and others of the Warring States sometimes had a daybook.However,we still have no idea how to choose a propitious day in the Western Zhou Dynasty.We hypothesize that the cyclic Stems and Branches are related to auspiciousness.Through a exhaustive statistical analysis of the inscriptions with propitious marks in the Western Zhou Dynasty(including the Spring and Autumn Period),we construct a model of hemerology based on the rules of the combination of Stems and Branches,and test it with dates of the ceming(king’s appointment) rituals,and the results are roughly consistent with the model.This Western Zhou hemerology model is different from the hemerology system in the Book of Rites" and other documents in terms of chronology and principles.MoreReset

结合空间人文方法的民国时期兰溪县城商业空间演变分析

2023年2月1日 00:00
作者:
单位:
Abstract: With the digitization and spatialization of humanities studies,the development of digital humanities and spatial humanities has provided new perspectives and methods for humanities research.The combination of spatial humanities methods and urban spatial researches fit well with broad possibilities.This study explored the possibilities of applying spatial humanities methods to urban spatial research under the scale of county.This research took the Lanxi County in Zhejiang Province during the Republic of China as the object,and used data including historical maps,statistical reports,thematically the spatial analysis tools used include Kernel Density Analysis,Colocation Analysis,and Network Analysis.The research retrospectively analyzed the pattern of street network in Lanxi County,and based on which,thematically analyzed of the spatial transition of the distribution of four commercial industries in Lanxi County:transportation,grain,traditional Chinese medicine,and teahouses.The results of spatial analysis and digital humanities mapping realized the different trends of the distribution of the industries along the Lanjiang River,and reflected the impact of the water transport outside the south gate and the emerging railway transportation on the commercial spatial distribution.MoreReset

数字人文时代的翻译测试与机器翻译——基于CiteSpace的现状与趋势分析(1992—2022)

2023年2月1日 00:00
作者:
单位:
Abstract: Digital humanities is an interdisciplinary research field that connects computer technology and social sciences.In the era of digital humanities,the study of translation assessment may advance with the development of information technology featured by machine translation.As such,with CiteSpace as the bibliometric tool,the state quo of the translation assessment study in China is analyzed and discussed.It is found that machine translation is now one of the core studies within the framework of language testing.But the focus remains the human evaluation of machine translation instead of directly applying machine translation to translation assessment.It is anticipated that the integration of translation assessment and machine translation will boost the development of both in the future.MoreReset

中古汉译佛经语体色彩的数字化呈现

2023年2月1日 00:00
作者:
单位:
Abstract: By means of the methods of digital humanities,the degrees of the colloquialism are quantized for the Middle Chinese translated Buddhist Scriptures based on the frequencies of the written and the spoken forms with respect to the featured words and then the linguistic styles are rendered with radar charts.These digital results reveal that most of the Chinese Buddhist scriptures are more oral than the Chinese native ones,but the degrees of the colloquialism vary among these Buddhist scriptures.Moreover,the evolution of colloquialism is not consistent for different word classes and different featured words.As a consequence,the methods of digital humanities help to clarify the special nature of Buddhist scriptures and thus promote the full exploration of the linguistic value of them.MoreReset

语言规划视角下中国社会的英语语言态度研究——基于数字人文的微博评论文本分析

2023年2月1日 00:00
作者:
单位:
Abstract: It is imperative to investigate the social-psychological factors in China’s foreign language education planning.Language attitude,as the core of these factors,needs more attention.Adopting digital humanities’ approach,this study collects Weibo comments on the topic "CPPCC member proposed abolishing English as a major subject in basic education" and conducts a sentiment analysis,keywords visualization and LDA topic modeling using R and Python to reveal people’s English attitude and further probe into the socialpsychological factors in China’s English education.The results show that objections to the proposal dominate the comments.English attitudes reflected in this event can be manifested as follows within the theoretical framework of Affective-Behavioral-Cognitive model of language attitude.Cognitively,the importance of English has been widely acknowledged and the awareness of language equality emerges in public.At a higher level,compulsory foreign language education’s contributions to social justice and national strategies have also been recognized.Affectively,evident public anxiety of developing speaking and listening skills exists.Behaviorally,the demand for English language education for diversified purposes has increased.This study holds that English attitude study could provide bottomto-up insights into China’s foreign language education planning.Meanwhile,the Digital Humanities method also provides support for the social approach to language attitude study.MoreReset

“中国音乐大典数据库”建设意义探析

2023年2月1日 00:00
作者:
单位:
Abstract: The Chinese Music Grand ceremony Database is a digital representation of the results of the Chinese Music Grand ceremony,containing music literature,scores and images from ancient times to the present,as well as audio and video resources of traditional music from fieldwork.Containing all aspects of Chinese music,professionally sourced and rich in content,this is the most comprehensive database of Chinese music ever produced.It is another epoch-making initiative in the collation and study of traditional music,following the Chinese folk literature and arts integrated book,and is equivalent to the Four Books of Chinese Music.This database is a comprehensive,three-dimensional and groundbreaking collection of literary,graphic,musical and audiovisual resources that present a comprehensive and sustainable picture of traditional music.MoreReset
❌