阅读视图
Hackathon: BiblioTech 2026
This event is only open to KU Leuven researchers, students and staff.
In March 2026, KU Leuven Libraries and the Faculty of Arts will organize the second edition of the BiblioTech Hackathon!
What is a hackathon? It is an event that is usually organized over a short period of time where participants come together in small groups and work intensively on a creative digital project or towards some digital end. In the case of BiblioTech, KU Leuven researchers, students, or staff will be divided into small groups and will work specifically on one of the datasets prepared (by LIBIS) for the hackathon. The groups will be guided by at least one group leader and will be able to rely on the help of an expert pool comprised of people who have specific technical knowledge and skills. The groups are free to follow their creative inspiration but must apply some digital approaches or tools to the dataset to produce an end result that will be presented in the form of a short presentation and a poster at the closing event of the hackathon.
Who are we looking for? One of the amazing benefits of hackathons is that they allow many different people with diverse backgrounds and skill sets to come together and to learn from one another. This is our goal for BiblioTech! We welcome applications from researchers at all stages of their careers, motivated students, and also KU Leuven staff members. Digital skills are not a must, but a willingness to learn about digital approaches definitely is. The hackathon should be a fun and engaging experience, and each participant should find themselves with new skills and perspectives at the end.
What about the data? The 2026 edition of the BiblioTech Hackathon is going places! Participants will have the option to work with two datasets both focused on the experience of travel. The first dataset comes from KU Leuven Libraries digitized collections and features historical picture postcards. The second dataset comprises historical travelogues. This combination of image, metadata, and textual materials provides many opportunities for the application of DH methods. We are all excited to see where this data leads you!
Practical details
The hackathon will span 10 days and will take place from Monday 16 March until Thursday 26 March. In addition to the working period of the hackathon, there will be a pre-hackathon brainstorming event where participants “Meet the Data, Meet the People,” prior to the start of the hackathon, a training day to learn how to use the infrastructure (ManGO and HPC service), and a closing event where the teams’ projects are presented.
- When: Mark your calendars for the following dates:
- Application Deadline: 5 January 2026 (23:59 CET)
- Pre-Hackathon Brainstorm | Meet the Data, Meet the People: 12 March 2026
- Infrastructure Training: 13 March 2026
- Hackathon Working Period: 16–26 March 2026
- Hackathon Closing Event: 26 March 2026
- from Monday 13 March until Thursday 23 March
- Where: Leuven (see above for more details)
- For whom: We welcome applications from researchers at all stages of their careers, motivated students, and also KU Leuven staff members. Digital skills are not a must, but a willingness to learn about digital approaches definitely is.
- Price: free
- Registration: Already convinced and want to take part? Great! Submit an application here. The deadline to apply is 5 January 2026 (23:59 CET). We look forward to hacking with you!
Want to see further details? Check out the BiblioTech Hackathon website for the most current information.

CAEDHET-Hackathon report (04/2025)
Event: AI4Culture Hackathon
On February 12 and from February 18 to 20, 2025, the AI4Culture Hackathon will see professionals, institutions and passionate individuals from the fields of AI, digital humanities, and cultural heritage teaming up to transform how shared cultural heritage is preserved, enriched, and interacted with.
The hackathon is designed to guide the participants to learn more about the AI tools available on the AI4Culture platform, trying them first-hand using unique cultural heritage datasets. Europeana, the treasure trove of European digital cultural heritage hosting over 50 million digital items, will be the primary repository for datasets. On the opening day, February 12, participants will be introduced to Europeana’s platform, explore curated collections, and get full access to their vast data via the Europeana API. This session will feature inspiring challenges designed to ignite innovative AI applications.
Participants will have the opportunity to explore five AI tools developed as part of the AI4Culture project. These tools include features such as AI-powered solutions for automatically generating multilingual subtitles for audiovisual content, applying OCR, transcription, and automated data enrichment techniques to enhance the quality and accessibility of digital collections, and detecting objects and other features in photos and videos.
Program
12 February 2025
10:00 – 18:00 CET | Irish College, Leuven
Hackathon Opening Day: Introduction to tools, data, and participants
13 – 17 February 2025
Online/Onsite
Independent team work
18 – 20 February 2025
Collaborative Work on Campus (Optional)
Collaborative space available at KU Leuven, Campus Arenberg
20 February 2025
15:00 – 19:00 CET | KU Leuven, Campus Arenberg, Heverlee
Hackathon Closing Day
Team presentations, awards ceremony and reception
Practicalities
Target audience: The initiative seeks individuals with diverse backgrounds and complementary skills, whether technical, creative, or interpersonal. While prior hackathon experience is advantageous, motivation is the primary consideration.
Date: 12 February – 20 February
Location: KU Leuven campus, Leuven (various locations)
Application deadline: 10 February
To see full details about the AI4Culture Hackathon, including the required application procedure, please visit the event webpage.
Interview Series: In Conversation with BiblioTech Hackathon Participants
The following is an interview between Alisa Grishin, Artes Research intern 2022-2023, and Lode Moens, BiblioTech Hackathon participant. The hackathon took place in March 2023. It was a 10-day event and included a pre-hackathon orientation moment called “Meet the Data, Meet the People.” Lode’s group, called StudentsBlock, worked on the Magister Dixit dataset, which features a collection of handwritten lecture notes of the old University of Louvain (1425-1797). You can learn more about the team’s work by having a look at their project poster in the BiblioTech Zenodo community. To read more about the hackathon and the results, you can visit the BiblioTech website.

Members of the StudentsBlock team pose for a photo at the reception of the closing event.
What first interested you in the hackathon? Have you done one before? What is your background?
I’d never done a hackathon before, since this was the first one directly associated with the Faculty of Arts [I decided to participate]. I was interested in it because my friend was going to apply, and she asked if I was interested. I’m not a huge ICT nerd but wanted to improve my skills, so it sounded like a good opportunity. So I looked at the datasets, and it sounded interesting. [I know] Professor Fantoli; I do some student work for her and thought why not. I have no experience programming, but I’m always eager to learn new stuff. I study Greek and Latin and a bit of Theology as part of my master’s.
What was your primary concern when beginning the project? The project? The process?
My first concern was whether we were going to find a subject. We had a lot of ideas but not a lot of it was super straightforward. So it took a while to establish a clear idea. Even then, it was a bit chaotic, but it was nice to see people taking the lead and our team leader did a good job at coordinating everything.
What kind of audience did you have in mind?
Our project was to optimize an existing database of student notes. Our first target audience was researchers – students, professors, PhD students, etc. of the Classics, history, philosophy, theology, law, basically anything that the notes represent, as well as anyone doing research on the Old University of Leuven for a Bachelor’s or Master’s thesis. Secondly, just people interested in the Old University and the courses. Yannick and Linde [team members] were also both familiar with notes from the University and are trying to reconstruct the old student and professor notes. So that was the less abstract audience. Very topical and applicable. The jury asked us what we would you do if we couldn’t read Latin, but transcribing all this material would take some time.
How did you establish your methodology and approach to the data set? Were you inspired by any other platforms or projects?
We had a PhD student on our team who comes from the Faculty of Economics and is now working in DH. She absolutely loves data and was able to figure out what we should do. We had three people working on the XML file. They cleaned the data and structured it. We had a specific goal in mind; we wanted to structure the data according to students, dates, etc., but we had some unorganized data. We also had two people working on the presentation. One was writing the HTML for the Toledo page [KU Leuven’s educational platform], and I myself took care of communication. So, the project was quite structured and everyone knew what they had to do. People used their backgrounds to work together. We had someone doing DH, Greek, and our team leader, Daria, is from LECTIO. So we knew what was possible to accomplish. We had a lot of meetings – two each day – so communication was efficient.
When did you reach out to the experts?
We reached out to ask if we could use the CSS profile of Toledo, as well as the Python script. So not in the very beginning, but during the first step.
What was the brainstorming process like at the “Meet the Data, Meet the People” event before the hackathon officially kicked off? Was there a clear vision from the beginning?
There was not a clear vision. Daria’s first idea was to work on doodles – so what students wrote in the margins of their notes. But the manuscripts we had were neat and more polished than expected. So if they talked about how to construct a globe, there would be technical drawings but no funny doodles. We were going to reconstruct the lecture cycles, but that was a bit too narrow. But it was great to meet the people before the stress of the actual hackathon. Once the hackathon started, it was chaotic. So having [established] personal connections first, then the technical skills, was a nice approach. One or two people also came [to the “Meet the Data, Meet the People” event] and had to withdraw – so it was good for them to get that first taste, and we didn’t immediately start working with them.
Were there any issues you ran into when cleaning up the data or creating the database?
There were some problems with the cleaning process because of the dates; not all manuscripts had dates in the metadata. So then we had to try to figure out what should be included in our data but we weren’t sure how to structure it. We had some conflicting ideas on how to target this. There were some Python problems which had to be recoded too. I also reached out to an old professor of Daria’s to take a second look at the Latin and help correct it.
How was the idea for the Toledo-inspired website first conceptualized and then implemented?
At the opening event we were throwing ideas out and someone said “oh we could visualize it as a Toledo page.” The idea stuck, but there was a practical problem: to be able to make it a decent page, we needed one student and three courses for the tiles [on the homepage of the platform]. That was something we managed to do with our dataset. It was kind of all or nothing. Once we saw it was possible, the writing started: the CSS profile, the underlying HTML. One person had a clear idea of what it could look like. Then it was a matter of filling it with content descriptions, side notes, history. We actually had someone who did a Master’s in History and so we were able to use some of his thesis topic and really had some fun with it. We had a structured dataset so we wanted to present it in a fun way. Toledo was a natural decision since it was familiar to so many students. All in all, it was a result of hard work.
How would you describe your experience?
It was very nice to work with people from so many backgrounds; math, history, philosophy, Dutch; you came into contact with people you wouldn’t normally meet. There were challenges in ways of working, ideas, etc., but in the end we worked well. We also all had a drink after so we all got along well [laughs] and we’re all pretty friendly now. It was interesting to be a part of the hackathon without a strong ICT background. It was really interesting to see what can be possible, especially when watching the other teams’ presentations. Our own dataset didn’t lean towards visualization like some of the other teams, but we still were able to learn a lot about the university, which brought the data to life.
How could you use these skills for future research?
I do textual research, so I look at lots of manuscripts. At this point, a model can’t really be trained for OCR because there are too few manuscripts and handwriting, regional differences, etc. make it difficult to analyze in that way. Medieval Latin is also not easily analyzed because the syntax is different and sometimes one manuscript has more authority than another. Sometimes it can also be wrong. The difficulty now is to interpret the Latin and there is difficulty in interpreting the data. As a classicist, the language should always be the main objective. DH can be a tool or asset, but it is not the main objective. I’ve been testing out different DH approaches, and, while I still like to work with it manually, there are always chances that my research will be become more digital.
What kind of tips would you give to a team doing their first hackathon? Any tips for someone from your background?
Try to apply your strengths – know your strengths and weaknesses. Try to look at the possibilities and what would be interesting to you on the subject of DH. I’m hoping to learn Python but also Old Norse and Arabic. It can be very stimulating to think about how to solve problems in a new way, and it might even be a bridge to build an interest in DH. And, don’t panic… there will be times when there is a lot of chaos but just look at what you have and where you want to go and just keep moving.
Interview Series: In Conversation with BiblioTech Hackathon Participants
The following is an interview between Alisa Grishin, Artes Research intern 2022-2023, and Ivania Donoso Guzmán, BiblioTech Hackathon participant. The hackathon took place in March 2023. It was a 10-day event and included a pre-hackathon orientation moment called “Meet the Data, Meet the People.” Ivania’s group, called the Poststars, worked on the historical postcards dataset, featuring over 35,000 old Belgian postcards. You can learn more about the Poststars’ project by having a look at their project poster in the BiblioTech Zenodo community. To read more about the hackathon and the results, you can visit the BiblioTech website.

Ivania presents her team’s project during the closing event of the BiblioTech Hackathon.
What first interested you in the hackathon? Have you done one before? What is your background?
I had done one before, but it was the more traditional type where you work all day and all night. I didn’t like it [laughs]. So when I saw this opportunity to work with data, and it was over the course of a few days, it really interested me. I wasn’t sure whether or not to participate since I didn’t know who I would work with and what their experience was, but I had a friend who wanted to participate so we signed up. In the end he couldn’t make it, but I thought it was a good experience! Even though I was the one with more experience in programming, it didn’t feel like it was a bad thing. I really liked that there were people from different backgrounds. For the dataset, I actually love to collect postcards so this dataset interested me. I had also worked with digital humanities before because I participated in a project which tried to explain a political process that was happening in my country. I had worked with historians, sociologists, journalists, and poltical scientists and that experience was very interesting to me. So I wasn’t afraid to work with people from another background; it was actually an exciting opportunity to me.
I’m an engineer. I got my degree in Chile and France. I’m officially an IT engineer, but I’m not very interested in the industrial and high-level side. I was always more interested in computer science, user interfaces, AI, user relationships, etc., so that’s what my master’s focused on. I’ve worked for two years in computer science, and now I’m doing my PhD. I’m working ith Prof. Katrien Verbert in user and AI relationships, specifically explainable AI.
What was your primary concern when beginning the project?
My main concern was that there were limitations in the time frame. I think that as far as technical competence goes I didn’t know what to expect, but realized the other members’ skills were an asset. Additonally, there were lots of people supporting us throughout the process.
What kind of audience did you have in mind?
At first, we wanted to really understand the connection between the postcards and the places where the photos were taken. We wanted to look beyond the postcard itself. This would have been really interesting, but it didn’t work out. We ended up creating a search engine where you could write a sentence—for example, “children playing outside in the park”—and it would return postcards that matched. It was a really interesting approach for a search engine because usually search engines look at metadata, but with our project, the search engine analyzed the image itself. When we developed the project, we had in mind someone who would be interested in exploring the postcard set as a dataset for future research projects.
How did you establish your methodology and approach to the data set? Were you inspired by any other platforms or projects?
Our brainstorming process involved thinking about how we could contextualize the postcards: time period, location, etc. We also were interested in exploring the destinations the postcards were sent to. So we could collect and organize information about where the postcard was from and where it went. We also thought about the usage of the postcard: postcards used to be a way to communicate with people. It was cheaper than a telegram and would arrive before you returned home. But exploring these ideas wasn’t possible because the metadata didn’t contain that information, and it was really hard to find this information. We decided to work on the search engine. For this, we were inspired by current AI models and their usages. CLIP, a model created by open AI had good examples on how it worked, and it matched some of our ideas, so we decided to use it.
How did you create the search engine for the website?
Our team leader, Prof. Tim Van de Cruys, suggested we use the CLIP model. It brings images and texts into the same plane. So we first created a mathematical representation for the dataset, and then it would convert the search entry to a mathematical representation. From there, it would produce results. There were some cases when it didn’t match. For some sentences there weren’t any matches. But I think that for this dataset, which is very image-based, it was quite useful.
How did you use your professional/academic experiences to help with the hackathon?
I used some approaches I had learned from my experience as a software engineer and data scientist. We used Github to share and collaborate. Specifically the project tracking tools helped us stay up-to-date with what everyone was doing. I had experience putting systems in production (putting them online), so I was able to join what everyone had done into one system and make it available.
Did you apply any of the topics you are studying for your PhD to the project?
Not really [laughs]. But now I proposed three masters topics that would work with these datasets. I see so much potential for enriching the data. One idea is a date analyzer for the postcards. For this, we could try to use an AI model to predict the time frame of the postcard. In some cases, we have dates, but they’re not complete. Sometimes there is an x in the date, or it’s incomplete. So, we want to train a model that can predict the dates. The model could also predict aspects of the postcard, including color, image, shape, quality, and then the researcher can confirm manually if it is right. The other project ideas were more about how to transcribe what was written on the postcards. Understanding what is written on the postcards can be hard since they are handwritten and messy. Creating an interface to help researchers to add more information to the postcards would allow them to be analyzed better by researchers. [These projects] would relate to my PhD because they provide explanations of predictions made by an AI model. People have to make decisions, but the AI decisions have consequences for research.
Was there any main motivator/goal that encouraged the team when things didn’t go as expected?
Everyone had this level of compromise where we wanted to do something cool. Nobody was like “we have to work every day all day.” But there was also this idea that we will do what we can. It was very propositional, and we were able to adjust and move on quickly.
What were some of the roadblocks you faced?
There were some things we wanted to focus on, like the text on the postcards, that we couldn’t. There was also no information in the metadata about the stamps. So the metadata—or the lack of it—presented a problem.
What kind of tips would you give to a team doing their first hackathon?
Enjoy it! It’s an opportunity to learn, so don’t stress about it. Make sure to think about how to fix things if it doesn’t work; don’t feel defeated. Move forward. In a hackathon, if it doesn’t work, then you need to keep moving on so that you have a project. It’s better to try and to not give up immediately. Try two or three times, and if it doesn’t work, then move forward. The proposal of the hackathon was just to do something new since nobody has been working very closely with all of these datasets. So anything you do is contributing – anything you do is better than nothing.
Do you have any final comments?
I really enjoyed the hackathon. Some of my data visualization students were in my group, which was fun. I loved the technical support. We didn’t have to worry as much about the data because it was well documented. We didn’t have to worry about getting access to it or how to handle such big datasets because that infrastructure was already place. That made it very fast to experiment. I thought it was great!
Interview Series: In Conversation with BiblioTech Hackathon Participants
The following is an interview between Alisa Grishin, Artes Research intern 2022-2023, and Daria Kondakova, BiblioTech Hackathon group leader. Daria is the Research Manager at LECTIO, the KU Leuven Institute for the Study of the Transmission of Texts, Ideas and Images in Antiquity, the Middle Ages, and the Renaissance. The hackathon took place in March 2023. It was a 10-day event and included a pre-hackathon orientation moment called “Meet the Data, Meet the People.” Daria’s group, called StudentsBlock, worked on the Magister Dixit dataset, which features a collection of handwritten lecture notes from the Old University of Leuven (1425-1797). You can learn more about the team’s work by having a look at their project poster in the BiblioTech Zenodo community. To read more about the hackathon and the results, you can visit the BiblioTech website.

Some members of the StudentsBlock team pose together during the reception of the closing event.
What first interested you in the hackathon? Have you done one before? What is your background?
I’m a Classicist, and I’m currently working with Digital Humanities. I’ve never done a hackathon before but was always interested in how they work, so this seemed like an interesting opportunity. The Magister Dixit dataset was specifically why I chose to do it because there is a project on it at LECTIO. It was a nice way to get to know the dataset better, especially since Magister Dixit has a long project history, and I’ve only been with LECTIO for one year.
I had an interest in digital humanities before; I did programming whenever I could, then found DH in 2014/2015. I wouldn’t call myself a digital humanist, though, since I only use DH methods and my background is very traditional. My background is in Classics. My PhD is in Classics, and I mostly do things in Greek and Latin, specifically -300 BCE to 200 CE. I don’t really work on Early Modern stuff. So, in a way, the only link between the Magister Dixit collection and my research is the fact that it’s in Latin. But for the project this helped because I also knew about the history, layout, color fonts, and how to pull the data. There were things I would have liked to explore more. So this project was more connected to my technical interests. I’m very open minded when it comes to DH and don’t worry about the relevance of something and whether or not I will work on it again – it’s just fun.
Can you expand on a topic you wish you could have explored more? Do you mean the doodles in the margins of the notes?
Yeah, there weren’t many doodles and the student notes weren’t like what you expect now. It was more of a syllabus rather than actual notes. It wasn’t spontaneous production in the same way that your class notes are today. It was more formal. There were illustrations, but we also didn’t really know what to do with those. The data extraction approach presented us with a less shiny option, but it was more practical and usable. The design of the team’s project website was great, but the data was just a simple table.
What was your primary concern when beginning the project? Interface? Usability?
I’m accustomed to working with texts, so when beginning to work with Magister Dixit I wasn’t expecting it to be so metadata-heavy and was thinking we would work directly with the student notes. Once I realized we couldn’t approach the dataset in the way I was expecting, I was a bit concerned about how we could adapt our approach. I wasn’t that much concerned about the technical aspect as it seemed that after the first meeting we would be able to work together. My main concern was just finding out what to do with the metadata.
Before beginning the project, were there any other platforms or projects which you were planning to use as inspiration?
Not really. We decided we would do the metadata analysis and we had the help of Jarrik Van Der Biest, an expert on the dataset, who is working on the student notes for his PhD. A few other people also knew about student life and the content of the course, which helped us frame our question. So we started with reconstructing the lecture cycles. Ultimately, we did this by creating a Toledo-like website, mirroring the e-learning environment [Toledo] that current KU Leuven students use.
How did you apply your knowledge from working at Lectio to the hackathon? Were there any tools or methodologies that were translated to this dataset?
My job is to support and implement DH in the projects. As such, I have to have a good overview of what is out there and bring people and tools together. This is something that worked well, but I also learned a lot from the team members, too. Even as a team leader, I learned a lot; I didn’t want this to be a classroom exercise. We had someone who worked with Open Refine, which I had not used before, and it worked really well since Python can be hard to understand sometimes. My role was less about suggesting or incorporating tools and more about bringing people together. There was an intrinsic connection with what I do at LECTIO just by listening to the team members and seeing how all the tools worked together. In the last few days, I also joined in with the data analysis and with the help of two other team members, I was able to brush the dust off my coding skills. The main connection to LECTIO, I think, was the subject matter and just seeing, on a smaller scale, the challenges and advantages of having a variety of people with different backgrounds working on a single project.
Do you think participating in a hackathon as a student would have benefitted you in your career?
Definitely. I would have loved to be able to do that. I did a group project when I was doing a DH class during my master’s. There wasn’t enough commitment and things just didn’t work; we didn’t learn from each other. But with a hackathon, if you stick around, you learn a lot from your team and other teams. The final presentation, for example, I took a lot away from. There we had the opportunity to figure out what else we could have done and what would have worked. As a student, it’s great to have a result. There’s an output. You’re not doing it just for the sake of doing it, and you have the support of others as you’re working on it. So it would have been great to be able to do this as a student.
What kind of practical knowledge do you think can be gained from a hackathon like this?
Open Refine was a great tool that I learned about. I also learned a lot about MARC 21 XML tagging scheme from the Library of Congress since we had to go through the tags. I also got a better understanding of the material of the Magister Dixit collection without even going through the actual content. I knew metadata is a treasure trove of information but I only knew that in theory. So then seeing it up front was more practical. Project management was also a skill I worked on. Other projects I’ve done were more extended and involved a lot of reflection and checking things. Having 10 days was a challenge, especially when combining with my job at LECTIO, as I had to check in and stay continuously involved.
Was there any main motivator/goal that encouraged the team when things didn’t go as expected?
I don’t know. There was a moment in the middle when it felt like everything was falling apart. Meeting online was very hard, and it was difficult to get everyone together anyway. I used Gathertown to have virtual drop-in rooms, which I think helped. We did feel like a team and wanted to work together, and during the rush we were all working on the same thing. Checking with Jarrik towards the end was very helpful as he was very encouraging and remarked on the project’s practicality and usefulness for researchers. Getting that feedback motivated us to improve the platform, and it felt good to know that we have a practical product. We were also kind of competitive and wanted to win [laughs].
What kind of tips would you give to a team doing their first hackathon?
Be open and ask questions. That was something that worked well for us. Sometimes people ask questions a bit too late. Reaching out to the expert pool earlier also would have been great. Because everyone has questions, and we need to address them to move forward. Be proactive. It is better to have five ideas and only one works than to not have any at all. Try to find something you would like to do. Even just observing something you don’t feel equipped to do, follow along and see what you can learn from your teammates. Use your strengths. Have fun! You shouldn’t approach a hackathon like an exam (but you do still have to be committed).
For team leaders, I learned a lot about how to organize groups. I would remind myself to notice when things don’t work. I had an idea in my head and then when it stopped working, I got stuck and had to become flexible quickly. And I thought I was flexible, but I realized I really wasn’t [laughs].
Any further comments?
I really enjoyed it. I didn’t realize it could be so fun! I really liked the sense of relevance for the library and seeing the datasets be used. I really appreciate the team of organizers and hope we do it again. I would love to be a team leader again, or maybe a team member. I would love to see the Magister Dixit collection used again, and we’re hoping to organize a way to discuss the other approaches and to further develop the database our team built. I think a lot of the projects have a lot of potential.
Kurzbericht zum Hackathon ‘KI und Mittelalter’
[Aktuelles] On site-workshop – Hackathon 05.02.2024
Interview Series: In Conversation with BiblioTech Hackathon Participants
The following is an interview between Alisa Grishin, Artes Research intern 2022-2023, and Annelore Knoors, BiblioTech Hackathon participant and student in the Advanced Master in Digital Humanities. The hackathon took place in March 2023. It was a 10-day event and included a pre-hackathon orientation moment called “Meet the Data, Meet the People.” Annelore’s group, the ChaoTech Warriors, worked on the wartime posters dataset featuring proclamations issued by the German General Government in Belgium during World War I. You can learn more about the ChaoTech Warrior’s project by having a look at their project poster in the BiblioTech Zenodo community. To read more about the hackathon and the results, you can visit the BiblioTech website.

Members of the Chaotic Warriors team pose for a playful team photo.
What first interested you in the hackathon? Have you done one before? What is your background?
I hadn’t done one before, but since the DH Master only lasts one year I wanted to take the opportunity to learn more, especially since there are so many different types of analysis and approaches. Also, the interdisciplinarity of the hackathon was an interesting opportunity. I think my team had a historian, informatics, PhD student, and two linguistics students. Then I have a linguistics and literature background and now do DH.
Why did you choose the DH Master?
I was curious to know about the recent developments within my field, and other fields since DH allows you to move beyond your initial field. This happened for me as I did the linguistics program. There are some classes that are for all of the Faculty of Arts which have to do with data and those really got my attention.
What were your expectations of the hackathon before beginning? Did you feel like you would be well-equipped for the project?
I didn’t think I would have enough skills to contribute to the team. Especially since the hackathon started in March, so I only had one semester of classes. But through the Introduction to DH course, I got a great basic knowledge. With technical things and analysis you always need to look at things precisely, so I knew what I had to look for. We did have to do a lot since we lost a few members, but the remaining members were able to do a lot, even without much previous knowledge.
Any concerns?
Not really. I hadn’t considered that so many people would drop out, but we managed well enough, and I think we had a great result even with the limited team members.
What was the group brainstorming process like for this project?
First, we noticed that people in the team had different approaches to the theme in relation to their profiles. So we really used our imaginations and were able to think non-traditonally. In the end we found a combination of ideas.
How did you establish your methodology and approach to the dataset? Were you inspired by any other platforms or projects?
I did the topic modeling, which I had already done for the Introduction to DH class when we did different titles of journals. I did topic models on the titles and institutions’ names. So that was nice because I could go in and only had to change a few things. We also made a timeline, which I had experience doing for an art history course, so I knew the platform was accessible and good for less technical users to use.
What was your role in the project and how was it different/in line with what you were expecting?
On a conceptual level, I thought I would be a beginner DH’er and people would have to teach me things. But I was more of the expert DH person. I could really teach the other team members like the informatics students with no background in humanities, and the others who didn’t know a lot about the digital side. So it was nice to be able to share what I had been learning for the past six or seven months. They told me they really liked learning that way.
What was the most challenging part of the hackathon?
I don’t think there was anything that was that big of a hurdle, honestly. I think the most difficult thing was the OCR that we wanted to correct; the layout wasn’t right. The timing of the presentation also meant that we had to tone down our ideas. Maybe if we had a bigger group we could have tackled a bigger project, but we were limited.
Was there any main motivator/goal that encouraged the team when things didn’t go as expected?
We basically got a lot of confidence in knowing that we can just go for it.
What kind of advice would you give to a team doing their first hackathon?
I think in the starting phase, don’t be afraid to sign up. Even if you don’t have any experience in the field, it’s a great opportunity to learn about data and meet people from other programs. You also have a team leader and the experts to help. It’s a great time to learn, and you get a lot of help.
Also, if you’re considering doing the DH Master’s, it’s a great opportunity to get to know the people in the DH community. The community is really strong. You get to know a lot of people, talk to them at the reception, etc. It’s great to get to know the people and the subject matter.
What kind of advice would you give to someone from the DH Master’s?
I would tell them that they will learn a lot from the expereince, and that they shouldn’t underestimate themselves.
Interview Series: In Conversation with BiblioTech Hackathon Participants
The following is an interview between Alisa Grishin, Artes Research intern 2022-2023, and Tom Gheldof, BiblioTech Hackathon group leader. The hackathon took place in March 2023. It was a 10-day event and included a pre-hackathon orientation moment called “Meet the Data, Meet the People.” Tom’s group, called the ChaoTech Warriors, worked on the wartime posters dataset, featuring proclamations issued by the German General Government in Belgium during World War I. You can learn more about the ChaoTech Warrior’s project by having a look at their project poster in the BiblioTech Zenodo community. To read more about the hackathon and the results, you can visit the BiblioTech website.

The ChaoTech Warriors team show their warrior stances during the closing event of the hackathon. On the right, Tom animatedly delivers the presentation of his team’s project at the closing event.
What first interested you in the hackathon? Have you done one before? What is your background?
I am an ancient historian by training and have a degree in journalism and cultural studies, after which I followed a training in what is now called digital humanities. When I started working as a researcher, I realized I could use my programming and digital skills more than my scientific skills. So when a position opened up with DARIAH (a European consortium for Digital Research Infrastructure for the Arts and Humanities), they were looking for someone with a balanced profile. I’ve always liked to balance digital tools and methods, even 10 years ago, so it’s a personal and professional interest. Now I am the day-to-day coordinator for CLARIAH-VL – the collaboration of DARIAH and CLARIN in Flanders. I’m mainly involved in helping researchers working in digital humanities where possible by using my main interests: historical databases and linked open data.
I can’t remember how I first got included in the hackathon [laughs]. The organizers had already planned that I would be participating, but through more casual meetings, the question came up whether I would be interested to take part as an expert or as a team leader. I had already participated in a hackathon at a conference before, but it was a lot more condensed. I enjoyed that hackathon in a way that I hadn’t expected. Mainly there were a lot of researchers, and it was the first time I worked in a small group with [people from] such different backgrounds. We worked with our own materials, which made it more ambitious, yet a lot more straightforward. With two days, you get right into it and have to keep to a timeline, making it a great experience.
What was your primary concern when beginning the project? Interface? Usability? What kind of audience did you have in mind?
My primary concern was whether everyone would show up [laughs]. I also had to prepare myself beforehand, because even though I am used to being a coordinator, I didn’t know what the background or level of enthusiasm of the group would be like. Also, the wartime posters corpus was beyond my familiarity and comfort zone as a researcher, so I wanted to familiarize myself. I began by exploring the corpus on the digital platform and had a look at the metadata. I looked at it as if I were a contestant to see what it would be like, then I decided to let the team give their ideas first.
How did you establish your methodology and approach to the data set? Were you inspired by any other platforms or projects?
The first thing I noticed was the quality of the digitization was not 100%. While most of the automatic translations were okay to read, we encountered problems with the metadata. I reflected on my past experiences with some other digitization projects, services, and tools that might have the potential to improve the translations. I asked my group members which ones they were familiar with and how they would approach such a challenging corpus. In their answers, it became clear that the members brought valuable backgrounds to the table, particularly from fields like linguistics, Natural Language Processing (NLP), and philosophy. The first gathering during the “Meet the Data, Meet the People” event was very exploratory. During that meeting, we just brainstormed about potential tools and methods.
How did the workflow and work distribution change once some group members could no longer participate?
I should disclaim that our group was struck with a bit of misfortune. Two of the eight group members cancelled their participation, and they were the ones who knew the most about the ManGO platform and the High Performance Computing infrastructure (HPC). Additionally, one of the more technically skilled members—who followed the explanation of the platform—also cancelled further on. So out of the 8, only 4 participated over the full 10 days. But even with a small group, we were motivated. I took encouragement from their motivation and expertise. I tried to encourage them to go beyond their comfort zones to explore the other tools that they might not have considered at first. This was very positive, but also a lot more work for each of us. In the end, we overcame the challenges, but we did have to downsize our ambitions because of the workflow. All in all, I think the members were pleased with the outcomes and with being able to build up their skills by using tools they hadn’t used before.
In the poster it says there were advantages and disadvantages to having a small corpus. What were those?
Our corpus was composed of 171 posters, but we were able to familiarize ourselves with it very quickly. We noted that it was mostly multilingual, and asked ourselves if we could do anything with that. And regarding the metadata: can we scrape it for a multilingual use? One of the disadvantages was that we couldn’t use some of the more ambitious tools. So, for example, exploring the use of the HPC infrastructure wouldn’t be of much help because of the feasibility, and our laptops weren’t able to do those computing tasks.
Was there any main motivator/goal that encouraged the team when things didn’t go as expected?
I must give credit to my team members for pursuing their personal interests. A lot of the members entered the hackathon not knowing if they wanted to do a research position or do the Advanced Masters of Digital Humanities. For them, it was great to experience if they would want to pursue this further. Speaking for my group as whole, the more you delve into the research corpus, the more you can see what can come out of it. Despite not having a lot of experience in the subject area, I was impressed by the language, content, formation, and layout of the posters as we explored a wide variety of research questions.
What kind of tips would you give to a team leader participating in their first hackathon?
I would again give the group as much responsibility as possible. You don’t need to worry, as a team leader, if they have the skills or digital competence, they will delve into it out of their own interest. By giving them responsibility, they might start exploring tools which they would have easily given to someone else. Don’t underestimate the creativity of the team members. At our brainstorming session, we ended with a variety of approaches after talking for only one hour. Out of their creativity came the most interesting research questions. I also benefited from how the main organizers said to not be afraid to reach out to the experts. We waited too long to do that. We had a great pool of experts from the library, Faculty of Arts, ICT department, PhD researchers, etc., and I was surprised to learn how wide their support reached. I think we might have benefitted more if we had reached out to them at the beginning. I will admit, I did want the team members to have full responsibility first so that they wouldn’t immediately reach out to the experts, but it’s a double-edged sword. Still, an expert could have made up for a missing team member. And according to feedback from my team, they really enjoyed the collective motivation to continue and the learning of new tools and methods that they applied.
Final Remarks
Thank you to the organizers for the flawless organization. It was a great balance of personal and online meetings, and, as a DH event, it worked really well. I really enjoyed the final event and was pleased with the quality of the posters, including our own. So for myself, it was a very pleasant experience, and I would not hesitate to be a candidate again as a team leader or even as a team member, just to see how fun that can be.
Interview Series: In Conversation with BiblioTech Hackathon Participants
The following is an interview between Alisa Grishin, Artes Research intern 2022-2023, and Sandra Elpers, BiblioTech Hackathon participant and Bachelor student in Theology. The hackathon took place in March 2023. It was a 10-day event and included a pre-hackathon orientation moment called “Meet the Data, Meet the People.” Sandra‘s group, the Illuminators, worked on the Bible of Anjou dataset. The Bible of Anjou is one of the most important and valuable pieces in the collection of KU Leuven’s Maurits Sabbe Library, the library of the Faculty of Theology and Religious Studies. The Bible of Anjou dates to 1340 and is a unique and beautiful illuminated manuscript created at the Royal Court of Naples. To learn more about the manuscript and The Illuminators’ hackathon project, have a look at their project poster in the BiblioTech Zenodo community. To read more about the hackathon and the results, you can visit the BiblioTech website.

Sandra presents the Illuminators’ final project during the closing event of the hackathon. On the right, the Illuminators pose for a team photo.
What first interested you in the hackathon? Have you done one before? Can you tell us more about your background?
This was my first hackathon and what interested me were the datasets. I also liked the fact that participating in the hackathon was available to all students. Since I’m only a bachelor’s student, it was a nice way to get launched into the program; originally I started studying French and Latin, but then I switched over to theology. For the hackathon, I was interested in an academic sense, but I also thought it was a good opportunity to learn new skills.
What was your primary concern when beginning the project?
Maybe my lack of technical skills. I found myself in a group with people with a lot of skills already and didn’t really know where were we going with it. But this worry was resolved quickly by just trusting the others in my team. We lifted each other up.
What most excited you about working with this dataset?
The beauty of the object. As an illuminated manuscript, it is aesthetically very beautiful.
Were you familiar with the Bible of Anjou before the hackathon?
I was not familiar with the Bible of Anjou itself. I was familiar with the Bible and Bible translations, but for this specific piece, I had only heard of the name and knew it is a big deal in the Maurits Sabbe Library.
What was the brainstorming process like for this project?
We had a lot of ideas come up originally; stuff based on the images and the text, but the text and the potential of OCR was more uncertain. With the images, though, we knew that with the animals we could make a sort of “medieval zoo.” Looking at the illustrations of musical instruments and categorizing the illustrations were some of the other immediate ideas.
What was your primary audience for this project?
I don’t think we really had an audience in mind when developing the project… it was more general. We just wanted to make the images work.
Were you inspired by any other platforms or projects?
We weren’t really inspired by anything specifically. One of the main aspects of our project involved creating GIFs of the illustrations in the manuscript. For this, we just came up with the idea and then researched how to do it. Researching the options told us it would be possible, and we just wanted to do it.
How were you able to apply knowledge from your studies to the project?
I was mostly learning as I went. But for the poster presentation, we did look into the historical context for the Bible of Anjou, and I saw some stuff that I already knew from my studies. So there was a bit of overlap. One interesting takeaway: I went to the library and had a great talk about the book and its content and the different locations it had been stored in. This helped us to be able to also contextualize its history.
Was your role in the project different or in line with what you were expecting?
My expectations were different. I tried to not have too many since it was my first hackathon. The experience turned out to be mostly in line with what I thought it could be. Since my technical skills were limited, I figured I’d have to learn some new things, and since I’m strong with presentations, it wasn’t too surprising that I ended up presenting our project during the closing event.
What kind of advice would you give to a team doing their first hackathon?
Try to be flexible and available. It was nice having 10 days and not having to meet every day, but it was necessary to sometimes see everyone. The Microsoft Teams environment was also very important to attend the meetings online. The organization within the chat on Teams was good, so there was a good stream of communication.
What kind of advice would you give to someone in your field, specifically?
I would tell them that they definitely have something to offer to the team. If not the technical skills, then it’s at least a way to provide more perspective. We all had different ideas and brought new ideas and perspectives.
Interview Series: In Conversation with BiblioTech Hackathon Participants
The following is an interview between Alisa Grishin, Artes Research intern 2022-2023, and Heike Pauli, BiblioTech Hackathon participant and MA student in linguistics. The hackathon took place in March 2023. It was a 10-day event and included a pre-hackathon orientation moment called “Meet the Data, Meet the People.” Heike’s group, the Digital Peripatetics, worked on the Lovaniensia dataset. Lovaniensia comprises the old academic collection, featuring the work published by members of the Old University of Leuven and output linked to the university spanning the period between 1425 and 1797. You can learn more about the team’s project by having a look at their project poster in the BiblioTech Zenodo community. To read more about the hackathon and the results, you can visit the BiblioTech website.

Heike delivers her team’s project presentation at the closing event of the hackathon. On the right, three members of the Digital Peripatetics team pose for a team picture.
What first interested you in the hackathon? Have you done one before? What is your background?
I had not done one before. My initial interest was because it was an opportunity to combine my passion for Latin, Greek, and AI. You don’t see that overlap often. So before the hackathon, I hadn’t seen a place where I could combine my overlapping interests in AI and the classics. This was a unique opportunity for that. Then a few of my professors mentioned it, and I continued hearing about it–probably like three times–and took it as a sign [laughs].
What was your primary concern when beginning the project?
I would say, for me, the OCR. But that was a surprising concern. We had to skip it entirely due to the quality and the number of errors. Apart from that we didn’t have a lot of troubles. And once we established that the OCR would be a challenge, we all just worked together. It was a missed opportunity because there was a lot we could have done and a lot available, but since we decided to look at the metadata and not the OCR it was a bit restricting. I would have liked to look at the text more. There is still a lot of room to explore this data.
What was your primary audience for this project?
Our target audience was mostly people interested in the Old University. Our group leader was a prime example of the type of person we were catering to. We tried to look at the university itself: what was being created, who was there. So, the target audience was basically the people who were in the group [laughs]. It was almost like we were creating a resource for ourselves.
How did you establish your methodology and approach to the dataset? Were you inspired by any other platforms or projects?
Indeed, I think some people were inspired by past projects. We just had to start with the metadata. We did the usual stuff like making a word cloud and other things you would typically do for Natural Language Processing (NLP); it was pretty exploratory. During the introduction at the “Meet the Data, Meet the People” event, the explanation of metadata was also really helpful. And it just started with the basic approach, and then, from there, we knew where to go. We used the tools we already had. I was at first disappointed that we couldn’t make a crazy new tool or code, but it’s challenging to do that in 10 days.
What was the brainstorming process like for this project?
We had a lot of ideas at first. At “Meet the Data, Meet the People” everyone was writing all over the brainstorming paper. Then we had to tone it down to make it coherent and achievable. We picked three or four tasks, which was only the tip of the iceberg. We had to think about what was manageable, so the different analyses were the best approach.
What was your role in the project and how was it different/in line with what you were expecting?
I was expecting that I’d need to program a lot, but I’m not an expert or anything. So it wasn’t necessary; we found that a lot of tools exist which could help with that. I focused more on the language. It was nice because we all had a place where we could best apply our skills. Some people were really great at social network analysis and worked on that, and my job was to work on, or maybe get mad at, the OCR [laughs]. Sometimes it was really accurate and helpful and then other times there were errors. At the end, I found my place in the language analysis side. So in looking at my background, the Natural Language Processing was an obvious approach. And I also loved presenting the project at the closing event. We divided the tasks according to what people wanted to do. At first it was hard to organize it that way, but it helped make it coherent and a singular project.
How did you use your academic experience to help with the hackathon?
The easiest part was the Latin. We were the most academically prepared for this. For me, it also helped to see that I could do some useful stuff with my interest in computational linguistics; I already knew a bit of Python, so it was nice to apply my DH skills and see that they are useful.
Was there any main motivator/goal that encouraged the team when things didn’t go as expected?
It helped that we all had a background and were interested in the dataset ourselves, so that was our main encouragement. It was within our field and we all had a common love for the material.
What kind of advice would you give to a team doing their first hackathon?
Just do it! Don’t overthink it. Even though there was a prize, the main interest was to get knowledge and to explore the data. We didn’t worry too much, honestly. Sometimes there can be frustrations or things won’t work, but those aren’t necessarily failures and you can explore new approaches.
What kind of advice would you give to someone in your field, specifically?
You can bring a new perspective even if you don’t know about technology. You can help those with a technological background if you’re open to it. There’s a stereotype for people in the Classics that we don’t know about computers and only care about books [laughs], but you don’t have to become a professional programmer during or after the hackathon. The knowledge you gain could be helpful for research, your career, anything. It’s not just for people who know how to program. It’s an option for everyone. Blur the lines.
Final comments:
I found it very useful to have a chance to explore all the fields I’m interested in. I would consider participating again. The 10 days was also a good timeframe in the grand scheme of things.
Interview Series: In Conversation with BiblioTech Hackathon Participants
The following is an interview between Alisa Grishin, Artes Research intern 2022-2023, and André Davids, BiblioTech Hackathon participant and employee at KU Leuven Libraries Economics and Business. The hackathon took place in March 2023. It was a 10-day event and included a pre-hackathon orientation moment called “Meet the Data, Meet the People.” André’s group, the Demographic Dynamo, worked on the historical census dataset. You can learn more about the team’s project by having a look at their project poster in the BiblioTech Zenodo community. To read more about the hackathon and the results, you can visit the BiblioTech website.
André Davids and Stijn Carpentier, two members of the Demographic Dynamo team, receiving their prize from Demmy Verbeke, jury member.
What first interested you in the hackathon? Have you done one before? What is your background?
It was my first hackathon. I saw an advertisement about it and thought it looked interesting, but I didn’t subscribe because I didn’t have a technical background. I was convinced by Nele Gabriëls who enthusiastically encouraged me to join. As I provided one of the datasets, industrial countings from the 19th century in Belgium, I was already familiar with some of the data. At the time this dataset was created, Belgium had a top class statistician, Adolphe Quetelet, and the Belgian statistics were the first that were open to the public and for research. The Belgians were the first to make this type of data available for research. It became a model for many other countries. This is also why it’s interesting to other countries to see the Belgian countings. So I’ve been converting these over to Excel files and universities often contact me regarding this dataset. I wanted to see the potential up close of using a dataset like this for researchers from Belgium and abroad.
I started working at KU Leuven Libraries in January 2000. It was mostly by coincidence. I graduated with a degree in Information and Library Science and then came to Leuven to fully learn Dutch. By coincidence, I found a job at KU Leuven; it was actually my first job interview. My first task was to be at the information desk, and then I switched to cataloging and then acquisition. Now I do digitization. So, I’ve been here for 23 years, but my job is always changing; there was never a day when I didn’t want to come to work.
What was your primary concern when beginning the project?
My primary concern before seeing who was in my group was that our group wouldn’t have enough technical skills to do something nice. My group was made up of mostly researchers, and I saw very quickly that I was a bit ad hoc and that they knew a lot. We started on our project quite late. At the beginning, we weren’t sure what to do with the data and had to make a selection. I was already very familiar with the data, so I helped them select the tables. 5 days after we received our datasets, we had a meeting and decided what to do with it. At the beginning, I was a bit stressed because time was passing and we didn’t have a concrete goal. After that meeting though, we found direction. So even though we were starting late, the technical members were very efficient.
How did you establish your methodology and approach to the data set? Were you inspired by any other platforms or projects?
Nope. The teammate who came up with the idea we went with had a dynamic map of Belgium, so she said we could adapt that. Then we looked at the data to see what could be used for the map. We saw that we had tables that showed internal Belgian migration. By putting this on a map, you could watch how people moved from one place, such as Brussels, to another. It was actually quite impressive to me.
What was the brainstorming process like for this project?
At the “Meet the Data, Meet the People” event, we actually spent most of the time coming up with a name [laughs]. We didn’t have a lot of ideas at the first meeting. And it took a few days to find that direction. But since I knew the data, I was able to share my knowledge with the team.
Were there any obstacles the team had to overcome?
Well, some of the members were absent because they were attending conferences. The people who were the strongest technically were not able to be at the closing event where we presented the project. I could speak mostly to the data, but luckily Stijn volunteered to present since he already does that pretty often. I told him he should be a salesman [laughs]. We were worried that there would be more technical questions. Neither of us could really speak to the code, but luckily the questions didn’t go there.
What was your role in the project and how was it different/in line with what you were expecting?
I don’t know what I expected. I had no idea what the outcome would be. I was pleasantly surprised that we had a dynamic product. But given the time frame, I wasn’t expecting to do something like that. Still, it was nice to see how quickly people with a technical background can do something.
What advice would you give to someone who is hesitant to participate in a hackathon due to their background?
I was initially afraid since I don’t have a lot of technical skills. But now I would say that everyone has something that they can bring. In my case, I knew the data and was willing to put a lot of time into the project. But if someone mentioned to me that they didn’t want to participate because of their technical skills, I would just say “[it] doesn’t matter. Other people will have the technical skills.” Just the fact that someone is willing to participate means that they will have ideas to bring and can contribute in their own way.
Did you apply any skills from your career to the project? Did you take away any knowledge you can now use for your job?
I didn’t really get new technical skills since I didn’t really have any preliminary technical knowledge. When we started working on the industrial counting project, though, we were focused on the numbers, not the categories. Eventually, as a researcher, I decided to start working on the categories. So I started using color codes for the tables and when doing this project, my teammate said that this actually helped him a lot with the code. So now, in the context of my work, I make sure to use the same colors since it will help the people working on the data in the future.
Also, last week, two researchers came to me and were asking for new volumes. I showed them our dynamic map because even though they need the volumes, showing our map shows them the possibilities of the industrial data.
What are some benefits of a hackathon for someone with a career in the GLAM sector?
It was a nice experience. Because we won the hackathon it was nice to have that, as well. It was nice to see that the data we had been working on could produce a winning project. But even if we hadn’t won, I would have come away with a good feeling. Even though I’m quite competitive [laughs], in this case, I was just happy to have something to show. The census data is usable for a ton of different fields and faculties and there was already interest in the data. So there’s still a lot of potential in the development of that. And it’s nice to see that people are interested in the data and what we generated even before this project.
What kind of tips, as the winning team, would you give to a team doing their first hackathon?
I feel that, in this case, winning was not our main objective. If we had more time, we could have done a lot. Just have an enjoyable experience and don’t think too much about winning. I’m not even sure if we would have had a better project if we had wanted to win in the first place. I don’t think we would have because of the pressure. But even if I could restart and pick a team, I would still work with this team and this dataset.
Interview Series: In Conversation with BiblioTech Hackathon Participants
The following is an interview between Alisa Grishin, Artes Research intern 2022-2023, and Fien Messens, BiblioTech Hackathon participant. The hackathon took place in March 2023. It was a 10-day event and included a pre-hackathon orientation moment called “Meet the Data, Meet the People.” Fien’s group, “God Save the Tweets!,” worked on the contemporary news media dataset featuring tweets including the hashtags #queueforthequeen, #abolishthemonarchy, and #queenelizabeth during a short span of time around the death of Queen Elizabeth II in September 2022. You can learn more about the team’s work by having a look at their project poster in the BiblioTech Zenodo community. You can also find data related to the technical aspects of the project in the God Save the Tweets! GitHub repository. To read more about the hackathon, view the full photo album, and discover the teams’ results, you can visit the BiblioTech website.

What first interested you in the hackathon? Have you done one before? What is your background?
When I first learned about the hackathon, I was immediately drawn to the opportunity of tackling a challenging problem within a limited time frame and collaborating with a team of individuals with specific expertise from a variety of backgrounds. I have a background in art history and digital humanities, especially with born digital collections in the GLAM scene. I can archive and preserve a tweet in a stable way, but have little knowledge on how to analyze the output. So this hackathon provided me with a learning platform where I could go about learning about this through trial and error.
What was your primary concern when beginning the project? Interface? Usability?
We analyzed the tweets from the first 10 days of the passing of Queen Elizabeth II. Our primary concern at the outset was the sheer volume of the data. We were confronted with a vast corpus comprising multiple languages, but we swiftly narrowed it down to English only. Despite this refinement, we still faced the challenge of dealing with a substantial dataset of 300,000 posts. Our main objective became comprehending and effectively managing this extensive stream of tweets by employing suitable methodologies.
What kind of audience did you have in mind?
That’s a tricky question.
The project posed a complex challenge as we used text analysis, sentiment analysis and data mining techniques. In addition, we also created a tweet generator, utilizing our existing corpus to craft newly generated tweets. The primary objective of our tweet generator was to cater to users who sought to share opinions and partake in engaging discussions on Twitter, fostering a sense of community and/or collaboration. Our target audience encompassed not only researchers but also a wider public audience, aiming to provide a platform for diverse individuals to express themselves and facilitate meaningful (or not so meaningful) interactions. In general, we were aiming for a broad public, not just researchers.
How did you establish your methodology and approach to the data set? Were you inspired by any other platforms or projects?
The methodology revolved around text analysis, also known as text mining. To ensure relevance, we filtered the English language tweets using specific keywords and then employed sentiment analysis techniques to analyze the data. In addition to drawing inspiration from other sentiment analysis projects, we benefited from the expertise of one team member who had previously explored sentiment analysis methodologies during a digital humanities course.
When it came to the tweet generator, we found inspiration in various existing tweet bots. We especially liked the @TorfsBot tweet bot, from the old rector of KU Leuven, which had publicly available source code that proved immensely helpful. While we drew insights from existing bots, we primarily relied on our own experiences and creatively adapted concepts from other tweet bots to shape our unique implementation.
Why did you choose to do the most active users as opposed to a larger pool? Was it just to downsize the data size or was there another reason?
We focused on the most active users because it helped us get a grasp on the users and those who were actively participating in online discussions. We observed an occurrence of overfishing in specific hashtags, where certain users exploited the hashtags to promote unrelated content. So, by looking at the top 10 or so active users, it allowed us to better understand the overall sentiments surrounding the Queen’s passing. This approach aided us in filtering out the excessive use of certain hashtags and reducing their impact. It allowed us to focus on the overall sentiment analysis without being influenced by the promotion of unrelated tweets or ideas that were going viral at the time.
How did you classify words/users on a -1.00 to 1.00 scale?
We utilized a sentiment analysis algorithm that employed a numerical scale ranging from -1.00 to 1.00. A score closer to -1.00 indicated a negative sentiment, while a score closer to 1.00 indicated a positive sentiment. The algorithm took into consideration various linguistic and contextual factors, such as keywords, to determine the polarity of sentiments expressed in the text. It is important to highlight that one of our team members had expertise in sentiment analysis, which significantly enriched our project.
Given the size of our team, we were able to allocate tasks efficiently. I primarily focused on developing the website, while other team members handled tasks such as presentation creation, analysis, and more. This division of responsibilities within the team proved to be highly beneficial, ensuring a smooth workflow and allowing each member to contribute their expertise to the project.
Was there any main motivator/goal that encouraged the team when things didn’t go as expected?
Digital humanities is all about embracing the adventure of trial and error, and boy, did we dive headfirst into it! Our incredible team leader, Leen Sevens, was our ultimate cheerleader, always keeping our spirits high with her GIFs and emojis. But it didn’t stop there – the organizing team of the hackathon also rocked our chat with their infectious motivation. Knowing that we had this amazing support system, cheering us on and having our backs throughout the process, made the journey all the more exciting and rewarding. It’s safe to say we had a real dream team!
What were some of the roadblocks you faced?
Our initial plan was to generate tweets for the tweet generator in the backend, but it proved to be more complex than anticipated. Not only would it have required purchasing additional storage, but it simply didn’t work out as intended. Thankfully, we had a solid plan B in place. We resorted to manually creating the tweets for our generator and seamlessly integrating them into the HTML and Java Script system.
Timing also presented its own challenges. With numerous tasks and responsibilities on our plate, the 10-day timeframe seemed like a whirlwind. However, there was a silver lining to this intense rush. It brought the entire team together, all of us uniting our efforts to meet the deadline. In a way, it fostered a sense of community among us, reinforcing our shared commitment to achieving our common goal.
What kind of tips would you give to a team doing their first hackathon?
I’ve already written some down [laughs].
- Plan, divide, and conquer. Let folks explore what tickles their brain and make it a learning experience for everyone.
- Communication is key. Regularly touch base, share ideas, and foster an environment of open dialogue.
- Embrace the beautiful chaos of errors, because that’s where the magic happens!
- Have fun and enjoy the process. It’s a great opportunity to learn from each other and work towards one shared goal.
Results: BiblioTech Hackathon
During March 2023, the first ever BiblioTech Hackathon took place! This initiative was jointly organized by KU Leuven Libraries and the Faculty of Arts. Over the course of 10 days, students, researchers, and KU Leuven staff came together to brainstorm and carry out a digital project that would valorize selected datasets (see the description on the BiblioTech website) from the library’s digitized collections. The end results far exceeded expectations. The groups proved to be self-sufficient, cooperative, and efficient, which resulted in seven extremely successful projects. Consequently, the judges at the closing ceremony thought it would be only right to recognize each team for their respective merits. The teams, their projects, and their received awards are as follows:
Best Reuse Potential
The Digital Peripatetics: Tom Bellens, Yanne Broux, Lydia Janssen, Sunkyu Lee, Heike Pauli, Laura Soffiantini & Manou Vermeire
Dataset: Lovaniensia
The project “Walk Like a Lovainiensis”: Profiling the Early Modern Homo Academicus through the Lovaniensia Dataset used social network analysis, OCR, and language analysis to illustrate the academic environment during the early modern period. Looking at theme, language and overall network at this time period, they synthesized data and determined the functionality of the dataset. They concluded that while they were able to pinpoint commonalities, a broad picture was unattainable as there was significant variability, and perhaps bias, within the data.
The Digital Peripatetics ‘project poster in the BiblioTech Zenodo community.
The Poststars: Tim Van de Cruys, Ivania Nadine Donoso Guzmán, Benoît Crucifix Lucia Allende Garrido, Peiyang Huo, Ina Lamllari, Iva Stojevic, Yining Zhu, Dawn Zhuang
Dataset: Vintage Postcards
The project Echoes of Yesterday: An Exploration of Vintage Postcards used geocoding, exploratory data analysis & visualization, and topical exploration of data to create a navigable and searchable website for the postcards. Of the 60,000 Belgian postcards in the dataset, 30,000 were used for the database. These postcards were then organized and clustered via the CLIP model so that the dataset could be easily searched by the user.
The Poststars’ project poster in the BiblioTech Zenodo community.
Best Presentation:
The Illuminators: Austine J. Crasta, Sandra Elpers, Suzanna Cuypers, Courtney Van de Mosselaer, Agni Vourtsi, Joachim, Bovin, Ni Li, Yun Liu
Dataset: Bible of Anjou
This project sought to highlight the miniature illustrations in the pages’ margins. After analyzing the textual and graphic elements of the Bible, the team compiled the most visually interesting and usable images and made them publicly available on the team’s website. GIFs were similarly produced to disseminate these images and the implementation of these gifs into the presentation helped to earn the team the “Best Presentation” award. A font alphabet was also created to further increase the usability of the graphics.
The Illuminators’ project poster in the BiblioTech Zenodo community.
Best Gimmick and People’s Favorite:
StudentsBlock: Daria Kondakova, Jarrik Van Der Biest, Linde Van den Eede, Lode Moens, Rossana Scebba, Haija Chen, Daniel Oltean, Thomas Cole, Jiaqi Zhu
Dataset: Magister Dixit
This project extracted data from the Magister Dixit to create a searchable database. Parsing the MARC21 XML of the Magister Dixit, the team pulled lecture subjects, professors, students, and dates. The database, which employs the Latin terms in cooperation with the original documents, is searchable by each of these elements. With an overarching goal of contextualizing the dataset to provide further insight into the academic structure of the old University of Leuven, the website elaborates on the limited previous understandings of academic life. It was the project’s creative whimsy, however, that helped the team win the “Best Gimmick” prize. The website reuses the style of Toledo, the digital platform for KU Leuven students, to create a playful digital learning environment representing a “day in the life” of a student from the old university.
The StudentsBlock’s project poster in the BiblioTech Zenodo community.
Best Survivors:
ChaoTech Warriors: Annelore Knoors, Tom Gheldof, Ujjayanta Bhaumik, Catarina Arnaud Boleto, Fatemeh Mirkazemiyan
Dataset: WWI Posters
The ChaoTech Warriors were assigned the Wartime Posters, a corpus of 171 (multilingual) proclamations issued by the German General Government in Belgium during World War I. All posters are available in a digitized image format with metadata descriptions to categorize them by year of publication, location, topic, etc. Each image is accompanied by an OCR text, produced on a line-by-line basis, rather than following the column division that distinguishes the layout of each poster. As the posters had French and German translations for the Dutch texts, the team generated also English translations. A timeline and AI poster generator were also created as interactive elements on the website.
The ChaoTech Warriors’ project poster in the BiblioTech Zenodo community.
Best Public Appeal:
God Save the Tweets: Fien Messens, Elisa Nelissen, Shirin Izadpanah, Azmi Boonmalert, Sara Zanetti, Anna Sofia Churchill, Area Maria Guede Ramos, Yuqi Zhu, Leen Sevens
Dataset: Tweets from the death of Queen Elizabeth II
Spill the Tweeat
This project dealt with the contemporary dataset of tweets written following the death of Queen Elizabeth II. Analyzing thousands of tweets using sentiment analysis technique, Spill the Tweeat visualized their findings by creating an interactive platform that generated pro-monarchist and anti-monarchist tweets. Additionally, they were able to create a timeline that highlighted the predominant sentiments exhibited through the tweets, such as tracking the level of sympathy expressed and their corresponding events.
God Save the Tweets’ project poster in the BiblioTech Zenodo community.
First Prize:
Demographic Dynamo: Stijn Carpentier, Ate Poorthuis, Gertjan Muyters, André Davids, Céline van Migerode, Olena Holubowska, Anirudh Govind
Dataset: Census Records
Census Data Sandbox: From Numbers to Narratives
This project sought to visualize census data so that it might be utilized by a variety of audiences. Because of the amount of information the census provides, Demographic Dynamo’s first priority was to make the information accessible, rather than synthesize or analyze the data. Using JavaScript, they created an interactive platform that connects the census data to a map. Their goal was to equip researchers with the ability to determine how useful a given census can be for research. The tool was created with the potential for reuse and transferability in mind so that it might be applied to other censuses, as well.
The Demographic Dynamos’ project poster in the BiblioTech Zenodo community.
The event was not only a fun ten days, but it also produced seven successful DH projects. On top of this, the participants had the opportunity to learn new DH skills and to meet new people in the KU Leuven network, sparking friendships and potential for future collaboration. Head over to the KU Leuven BiblioTech Hackathon Zenodo community to see all of the project posters!
Internship Introduction: Hackathons and Promoting Cultural Heritage Materials
My name is Alisa Grishin and I am currently serving as one of the 2022-2023 Artes Research Interns. Currently a Master’s student of Cultural Studies at KU Leuven, my main role will be to support the organization of the BiblioTech Hackathon. For this, I will research the potential that hackathons hold for increasing visibility and fostering engagement with cultural heritage materials. One of the main outputs of this research will be an interactive map of existing hackathons in the academic context and will place the BiblioTech hackathon on this map. I will also be developing a handbook for the organization of future iterations of the BiblioTech Hackathon. Further, I will contribute to the Scholarly Tales blog, enriching both my own knowledge of Digital Humanities and also the collective knowledge the blog offers to readers.

Alisa is expected to graduate in the Summer of 2023. She previously received a B.A. in History with a concentration in Public History and minors in Political Science and Legal Studies from Salem State University.
Prior to coming to Belgium, I grew up and studied in Salem, Massachusetts. With this came exposure to difficult historical reconciliations and complicated understandings of local heritage. As a child, I would regularly act as an afflicted child in Salem Witch Trials documentaries. While at the time it was a fun way to get out of school, eventually there was a certain level of appreciation for my small role in bringing attention to this oft-misconstrued chapter in my nation’s history. Thus began an interest in public history and promoting access to cultural heritage.
Now with a background working at local museums, an art law nonprofit, and other nonprofits in the arts sector, I have grown especially interested in the use of policy to help preserve personal and collective cultural heritage. Greatly attuned to narratives and biases in history, I have found that fair access to (in)tangible heritage and encouraging cross-cultural discourse is instrumental to the development of protection policies. In other words, in order to ensure that local and state governments can preserve heritage-linked places, things, and ideas, we, as students and researchers in the cultural sector, need to do our part to make these elements available to the general public.
Having previously studied history, my relationship with historical materials has always been quite tangible. Although digital heritage is on the radar of many historians, There is still much room for the application of digital humanities methods and tools beyond digitization and the publication of online collections. As technology advances and the world becomes more digital, the potential the application of digital humanities tools and methods holds in the cultural heritage sector only continues to increase.
Museums, libraries, and other cultural institutions often have spent dozens — if not hundreds — of years to build collections, audiences, and reputations. Despite this established history, these institutions must engage with current digital developments to maintain their relevance and impact for the future. The increased access to digitized materials or electronic editions means that cultural institutions must adapt and evolve. This expansion does not just mean that they can no longer rely on the in-person visitation they had in the past; it also means that these institutions have an obligation to meet their audience on the patrons’ own terms. The objectives of digital scholarship help fill this gap — expanding Open Access, improving informational literacy, and digitizing and visualizing collections are just a few examples of the ways Digital Humanities can work with cultural institutions.
Hackathons are a way to extend the internal missions and goals of cultural institutions. Libraries, in particular, can benefit from organizing hackathons as a means of promoting their collections and encouraging education when it comes to their materials, but also when it comes to acquiring digital skills for engagement with and exploitation of those materials. In action, this leaves libraries as either the site of a hackathon or the subject of a hackathon; in the case of BiblioTech, the KU Leuven Libraries is both. This event is therefore an exciting opportunity for not only the libraries as they branch out into more technological initiatives, but also the general public. Events like this make accessing collections and data easier, in turn making the library more relevant and innovative to a twenty-first century researcher.
As we continue the hackathon preparation, I find myself creating mental notes of what I am most looking forward to (apart from the anticipated reward of a successful event). While I have much to learn in regard to the full potential of DH and its many applications, I am excited to observe how it can support the preservation of cultural heritage.
Do you want to know more about the BiblioTech Hackathon? You can visit the hackathon website here!
Hackathon: BiblioTech 2023 (KU Leuven)
This event is only open to KU Leuven researchers, students and staff.
In March 2023, KU Leuven Libraries , the Faculty of Arts and the Humanities and Social Sciences Group will organize a hackathon. The objectives of this initiative are two-fold: firstly, to put the library’s digital collections in the spotlight and secondly, to strengthen the digital humanities (DH) community at KU Leuven.
What is a hackathon?
It is an event that is usually organized over a short period of time where participants come together in small groups and work intensively on a creative digital project or towards some digital end. In the case of BiblioTech, KU Leuven researchers, students, or staff will be divided into small groups and will work specifically on one of the datasets prepared (by LIBIS) for the hackathon. The groups will be guided by at least one group leader and will be able to rely on the help of experts who have specific technical knowledge and skills and who will be part of the expert pool. The groups are free to follow their creative inspiration, but must apply some digital approaches or tools to the dataset to produce an end result that will be presented in the form of a short presentation and a poster at the closing event of the hackathon.
Find out more about the collections we will be working with here!
Program
The hackathon will span 10 days and will take place from Monday 13 March until Thursday 23 March. In addition to the working period of the hackathon, we will have an opening and closing event and an orientation event, “Meet the Data, Meet the People,” prior to the start of the hackathon.
- Tuesday 7 March 2023: Meet the Data, Meet the People
- Location: Zaal Couvreur, Leercentrum AGORA
- to be confirmed
- Monday 13 March – Thursday 23 March 2023: Hackathon is live!
- Location:
- Subject to preference of teams; online, physical, or hybrid
- 3 working rooms available to book in Leercentrum AGORA
- Location:
- Monday 13 March: Hackathon Kick-off Event
- Location: University Library, Van Cauwenberghzaal with reception in the gallery
- Time: 17h00-21h00
- Thursday 23 March: Hackathon Closing Event
- Location: University Library,Van Cauwenberghzaal with reception in the gallery
- Time: 16h00-21h00
Practical details
- When: from Monday 13 March until Thursday 23 March
- Where: Leuven (see above for more details)
- For whom: We welcome applications from researchers at all stages of their careers, motivated students, and also KU Leuven staff members. Digital skills are not a must, but a willingness to learn about digital approaches definitely is.
- Price: free
- Registration: Already convinced and want to take part? Great! Submit an application here. Registration closes on the 13th of January 2023.
Want to see further details? Check out our webpage here for the most current information. We look forward to hacking with you!
Dickens Letters Hackathon
6pm on Friday 3 February to 5pm on Sunday 5 February
Birkbeck (University of London)
Interested in digging into datasets? Looking to meet like-minded hackers and software enthusiasts? Have ideas for digital approaches to literary and historical data you’d like to try out? Join us for a two-day event where, working in small teams, you can develop exciting ideas using the letters of Charles Dickens.

Expressions of Interest are invited for IT professionals, programmers, hackers and digital humanists. No significant prior knowledge of Dickens is required: an introduction to the context of the letters will be provided. You will work in teams of fellow hackers with a TEI-encoded dataset of Dickens’s letters over a weekend, with the aim of producing an idea for an innovative piece of software, an app or a game based on the letters of one of the most famous writers in English.
The event will include a free tour of the Charles Dickens Museum, and a chance to work intensively with other like-minded hackers over the course of a weekend. The event is free of charge, and refreshments will be provided. Prizes will be awarded for the best app or game, together with an opportunity to take your idea forward. The Charles Dickens Museum will be open to participants to visit at their leisure on the afternoon of Friday 3 February, before the start of the hackathon. If you have any questions, please contact editor@dickensletters.com.
You can register your interest here: https://docs.google.com/forms/d/e/1FAIpQLSeSI7EiyMLbybpYcSbn-682dNgbI-HQ32lJLea_2yE3zHMGFQ/viewform
Leon Litvack & Emily Bell
Editors, the Charles Dickens Letters Project
https://dickensletters.com/
