==== Film corpus ==== Last modified by: Grace I. Lin (glin@soe.ucsc.edu) Unix commands that I have to look up EVERYTIME: Compress: tar -cvzf file.tar.gz inputfile1 inputfile2 Uncompress: tar -xvzf file.tar.gz Count number of files in the directory: ls -1 | wc -l What's here: * film_20100519/ - an older version of film corpus from IMSDB (2010) * film_20100519.tar.gz * film_2012xxxx/ - a newer version of film corpus from IMSDB; currently in progress * film-dialogue-lrec2012-v6.pdf : LREC 2012 paper * lrec2012-vertical-2.pdf : LREC 2012 poster