Summer Semester 2024
Prof. Dr. Nathan Gibson
Last objective: Consider your personal goals for the semester within the big picture of digital data in religious studies.
Pick | Prepare | Process | Package |
Inputs/Sources | Structuring data | Analysis | Outputs/Presentation |
Manuscripts, Photos, Interviews | Transcribing, Collating | Textual comparison, criticism, content analysis, coding | Edition, Narrative, Thematic discussion, Interactive website |
A question for a ChatAI?
Understand plain text as a foundational type of data.
a. Text
b. Very fancy
c. Backwards
d. Emojis 🦉👀🐁❤️🐛
e. Invisible character
f. Math equation 𝓐 = 4𝛑𝑟²
g. Arabic عربيّة
h. Domino game 🁍🀱🀲🀺🁃
i. Hieroglyphics (version 1)1:
j. Hieroglyphics (version 2):
𐦐𐦗𓂅𓂧𓂋𐦉𓂂𐦇𓂂𓂑𐦓
k. Screenshot
l. Page scan
⠀⠀⠀⠀⠀⠀⠀⠀⢀⣴⣾⣿⣷⣾⣿⣿⣿⣷⣤⣤⣀⡀⠀⠀⠀⠀⠀⠀⠀⠀ ⠀⠀⠀⠀⠀⠀⢀⣤⣿⣿⣷⣢⣌⣭⣍⢻⣿⡟⣽⣿⣿⣿⣦⠀⠀⠀⠀⠀⠀⠀ ⠀⠀⠀⠀⠀⠀⣾⣿⠟⣷⡾⠛⠋⠉⠛⠛⠛⢷⣉⣭⣿⣿⣿⡀⠀⠀⠀⠀⠀⠀ ⠀⠀⠀⠀⠀⢾⣿⣿⣿⣿⠁⠀⠀⠀⠀⠀⠀⠀⣿⣷⡌⣿⣿⣷⠀⠀⠀⠀⠀⠀ ⠀⠀⠀⠀⠀⣸⣿⣿⣿⡟⠀⢀⡀⠀⠀⠀⢀⣀⢹⣿⡿⢿⣿⣿⠀⠀⠀⠀⠀⠀ ⠀⠀⠀⠀⠀⣿⣿⣿⣿⡇⣎⢥⣿⣷⡄⣾⣿⣭⢽⣿⡇⣾⣿⣿⡄⠀⠀⠀⠀⠀ ⠀⠀⠀⠀⢀⣿⠻⢟⣿⠇⠈⠋⠁⢹⡀⣿⡇⠹⠟⣿⣿⣮⣤⣾⣿⠀⠀⠀⠀⠀ ⠀⠀⠀⠀⠈⢿⣷⣿⢿⣠⠀⠀⠀⣤⡄⣻⡧⠀⠀⣿⣿⡟⠿⠿⠃⠀⠀⠀⠀⠀ ⠀⠀⠀⠀⠀⠀⢀⣴⣿⣿⡄⠀⠠⣤⢌⠭⣥⢀⣼⣿⣿⣿⣷⣄⠀⠀⠀⠀⠀⠀ ⠀⠀⠀⠀⠀⠀⢺⣿⣿⣿⠋⢦⡀⠐⠓⠒⣿⣿⣿⠿⣿⣿⣿⠏⠀⠀⠀⠀⠀⠀ ⠀⠀⠀⠀⠀⠀⠀⢹⣿⣿⣆⠈⠻⢶⣶⣾⡿⠟⠁⢀⣿⣿⡟⠀⠀⠀⠀⠀⠀⠀ ⠀⠀⠀⢀⣠⣴⣾⣿⣯⢿⣏⢦⡀⠀⣰⢧⡀⠀⣠⡞⣿⣿⣿⣷⣦⣄⡀⠀⠀⠀ ⠀⣤⣾⣿⣿⣿⣿⣿⣿⣷⡻⣦⡉⢿⢭⡞⣻⠿⢋⣼⣿⣿⣿⣿⣿⣿⣿⣷⣦⠀ ⣼⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣷⣿⣿⣷⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣧ ⠻⠿⠿⠿⠿⠿⠿⠿⠿⠿⠿⠿⠿⠿⠿⠿⠿⠿⠿⠿⠿⠿⠿⠿⠿⠿⠿⠿⠿⠿
Text as the most important unit of human-machine interaction
Many humanistic disciplines such as history, philology, literature, theology, and anthropology primarily use textual sources.
Many humanistic disciplines also do their analysis and communicate their results using primarily text.
Text without formatting:
Plain text
Formatted text
Even crazier formatted text!
Plain text can be opened in any text program. Microsoft Word, Notepad, TextEdit, even a web browser. Try it!
You might see “.txt” on the end of a plain text file, but actually the file extension doesn’t really matter. Try it!
But plain text still has an “encoding.”
Arabic in an old encoding (Windows-1256) Try it!
Arabic in Unicode (UTF-8) Try it!
A regular expression is a sequence of characters (string of letters) that defines a text pattern.
Different programs use different standards for those patterns, but many of them work similarly and are called “RegEx.”
For example, to find numbers in a text, you don’t have to search many different times for “0” and then “1”, “2”, etc. You can just search for “\d” which represents all numbers.
Search for patterns of text and replace them with other patterns. For example,
https://docs.google.com/spreadsheets/d/1jTmHopCz8Il6tBopZlG2LfgMSEOuvMJh-Q7nzUwLkvE/edit?usp=sharing (You can experiment by copying the document for yourself or using rows 30+)
See also:
Download and install https://code.visualstudio.com/
Git Versioning
Lundström, Peter. (2020). PHARAOH.SE Available at: https://pharaoh.se/ancient-egypt/pharaoh/cleopatra-vii/ [Accessed 25 Apr. 2024]. CC-BY 4.0. ↩