This is an old revision of the document!


Neven Jovanović: Digital Humanities Interests

Department of Classical Philology, Faculty of Humanities and Social Sciences, University of Zagreb, Hrvatska / Croatia

Current

Planned

  • a school corpus of Greek and Latin texts aligned with Croatian translations, to be used in classrooms (grammar schools and universities, some 25.000 potential users); see the rationale in this document (Google Drive); some musings on Greek to Latin translation (done by Croatians) here: levenshtein-translation
  • use this corpus, together with treebanks, to produce a modern Greek - Croatian and Latin - Croatian school dictionary, with contributions by students
  • use these experiences and materials to build a large bilingual anthology of Croatian Latin writers, both as an online publication and as a book
  • adapt treebanking to make possible a language-neutral, world-wide competitions in Greek and Latin language proficiency (students produce treebanks of Greek and Latin sentences, with grammatical terminology in their own first language, but terminology is compatible across languages)
  • a set of instructions for doing research with CroALa, see samples here: croala-schola.
  • a BaseX database of Mercurius Croaticus, searchable with XQuery, to serve as a bibliography of manuscripts and printed Croatian Latin texts (cf. basex-adv)

My user pages

Other

farm3.static.flickr.com-2087-2145630043_12e9b414e4_b_d.jpg (Flickr Crepùsculo by . SantiMB .)

Texts

Bibliographies

farm1.static.flickr.com-152-424397178_05989fcfba_d.jpg (Manuscript by http://www.flickr.com/photos/carowallis1/.)

XML and XSLT-related

PhiloLogic configurations

Scripts

  • 100m.sh is a Linux shell script writing any amount of lines with numbered <milestone/> elements (and its @facs attributes!) into a file. Syntax: ./100m.sh 3 etc.
  • praznim.sh is a Linux shell script which deletes all lines from a given document. Syntax: ./praznim.sh foo.txt etc.

  • shellozov.sh is a Bash script which takes a list of (inflected) words in a file (using LWP::Simple Perl module), sends the words one by one to the LemLat lemmatizer, greps and stores the lemmata in another file, then removes the duplicates and orders the rest alphabetically. Literature: Sean M. Burke, Perl & LWP (2002; there is an internet edition as well).
  • verba.pl is a Perl script which reads a list of Latin words, one by one, sends them to the v4.pl script for querying the Perseus under Philologic parser, and returns (or not!) the lemma from the dictionary.
  • v4.pl is a Perl script called from verba.pl; it queries the Perseus under Philologic Latin parser and returns the lemma.

  • usp-bez.sh is a bash script which compares two lists of words; it reads a list in file1, compares the words with the list in file2, and writes the words not in file2 to file3. Syntax: ./usp-bez.sh foo.txt foo1.txt result.txt
  • usp.sh is a bash script which compares two lists of words: it reads a list in file1, compares the words with words in file2, and writes the words both in file1 and file2 to file3. Syntax: ./usp.sh foo.txt foo1.txt result.txt

Of course, the above is simpler with the comm utility…

 
jovanovic-digital-bibliography.1363192068.txt.gz · Last modified: 13. 03. 2013. 17:27 by njovanov
 
Recent changes RSS feed Creative Commons License Donate Powered by PHP Valid XHTML 1.0 Valid CSS Driven by DokuWiki