You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
<td><ulclass="text-left"><li>Tika in Action, Chapter 1</li>
303
-
<li>Mattmann, Chris. A vision for data science. Nature, Vol. 493, No. 7433, pp. 473-475, January 24, 2013. <strong>(Presented by: Alexander Mermelstein)</strong></li>
303
+
<li>Mattmann, Chris. A vision for data science. Nature, Vol. 493, No. 7433, pp. 473-475, January 24, 2013. <strong>(Presented by: )</strong></li>
304
304
<li>Lynch, Clifford. "Big data: How do your data grow?." Nature 455.7209 (2008): 28-29.</li>
305
305
<li>Howe, Doug, et al. "Big data: The future of biocuration." Nature 455.7209 (2008): 47-50.</li>
306
-
<li>Wigan, Marcus R., and Roger Clarke. "Big data's big unintended consequences." Computer 46.6 (2013): 46-53. <strong>(Presented by: Josephina Bian)</strong></li>
306
+
<li>Wigan, Marcus R., and Roger Clarke. "Big data's big unintended consequences." Computer 46.6 (2013): 46-53. <strong>(Presented by: )</strong></li>
307
307
<li>Schwartz, J. A. N. A., et al. "Measuring the value of Big Data exploitation systems: Quantitative, non-subjective metrics with the user as a key component." Parsons Journal for Information Mapping 6 (2014): 1-12.</li>
308
-
<li>Sotera Defense Solutions. A Survey of Big Data Methods, Assessments, and Approaches. November 2012 <strong>(Presented by: Bess Djavadi)</strong></li>
309
-
<li>De Mauro, Andrea, Marco Greco, and Michele Grimaldi. "What is big data? A consensual definition and a review of key research topics." AIP conference proceedings. Vol. 1644. No. 1. AIP, 2015. <strong>(Presented by: Bongjun Kim)</strong></li>
308
+
<li>Sotera Defense Solutions. A Survey of Big Data Methods, Assessments, and Approaches. November 2012 <strong>(Presented by: )</strong></li>
309
+
<li>De Mauro, Andrea, Marco Greco, and Michele Grimaldi. "What is big data? A consensual definition and a review of key research topics." AIP conference proceedings. Vol. 1644. No. 1. AIP, 2015. <strong>(Presented by: )</strong></li>
310
310
</ul>
311
311
</td>
312
312
<td>Resources:<br/><br>
@@ -319,7 +319,7 @@ <h2 class="section-heading">Schedule <font class="usc-color">(subject to change;
@@ -330,13 +330,13 @@ <h2 class="section-heading">Schedule <font class="usc-color">(subject to change;
330
330
<td><ulclass="text-left"><li>Tika in Action, Chapter 2</li>
331
331
<li>Crocker, David. RFC 822 "Standard for the format of ARPA Internet text messages." (1982).</li>
332
332
<li>Freed, Ned and Nathaniel Borenstein. RFC 1341. MIME (Multipurpose Internet Mail Extensions). Mechanisms for Specifying and Describing
333
-
the Format of Internet Message Bodies. June 1992. <strong>(Presented by: <ahref="https://youtu.be/nYWkoGhFDaw">Ruoming Li</a>)</strong></li>
334
-
<li>Freed, Ned, and Nathaniel Borenstein. RFC 2045. Multipurpose internet mail extensions (MIME) part one: Format of internet message bodies. 1996.<strong>(Presented by: <ahref="https://youtu.be/awAe-Pu4KMQ">Tania Dawood</a>)</strong></li>
333
+
the Format of Internet Message Bodies. June 1992. <strong>(Presented by: )</strong></li>
334
+
<li>Freed, Ned, and Nathaniel Borenstein. RFC 2045. Multipurpose internet mail extensions (MIME) part one: Format of internet message bodies. 1996.<strong>(Presented by: )</strong></li>
335
335
<li>Freed, Ned, and Nathaniel Borenstein. RFC 2046 Multipurpose internet mail extensions (MIME) part two: Media types, November, 1996.</li>
336
-
<li>Freed, Ned. RFC 2048 "Multipurpose internet mail extensions (MIME) part four: Registration procedures." ISI (1996). <strong>(Presented by: <ahref="https://youtu.be/Xo0n2dgpDPU">Jimin Ding</a>)</strong></li>
336
+
<li>Freed, Ned. RFC 2048 "Multipurpose internet mail extensions (MIME) part four: Registration procedures." ISI (1996). <strong>(Presented by: </li>
337
337
<li>Hicks, Ben J., et al. "Organizing and managing personal electronic files: A mechanical engineer's perspective." ACM Transactions on Information Systems (TOIS) 26.4 (2008): 23.</li>
338
-
<li>Shim, Jungwon Roy. "Arium: Beyond the Desktop Metaphor: A new way of navigating, searching, and organizing personal digital data." Masters Thesus, Carnegie Mellon University (2012).<strong>(Presented by: <ahref="https://youtu.be/1qsEHxS2cMM)">Kelly Choy</a>)</strong></li>
339
-
<li>Crowder, Jerome, Jonathan Marion, and Michele Reilly. "File Naming in Digital Media Research: Examples from the Humanities and Social Sciences." Journal of Librarianship and Scholarly Communication 3.3 (2015). <strong>(Presented by: <ahref="https://www.youtube.com/watch?v=zYuI2hkmymA">Annie Chang</a>)</strong></li>
338
+
<li>Shim, Jungwon Roy. "Arium: Beyond the Desktop Metaphor: A new way of navigating, searching, and organizing personal digital data." Masters Thesus, Carnegie Mellon University (2012).<strong>(Presented by: )</strong></li>
339
+
<li>Crowder, Jerome, Jonathan Marion, and Michele Reilly. "File Naming in Digital Media Research: Examples from the Humanities and Social Sciences." Journal of Librarianship and Scholarly Communication 3.3 (2015). <strong>(Presented by: )</strong></li>
340
340
<li>Jackson, Andrew N. "Formats over time: Exploring UK web history." arXiv preprint arXiv:1210.1714 (2012). </li>
341
341
</ul>
342
342
</td>
@@ -345,19 +345,19 @@ <h2 class="section-heading">Schedule <font class="usc-color">(subject to change;
<li>Report outs from the in class discussion around classifying files and the MIME taxonomy</li>
351
351
<li>Document Similarity and Deduplication</li>
352
352
<li>Individual Presentations - Week 2 Papers</li>
353
353
</ul></td>
354
354
<td><ulclass="text-left"><li>Tika in Action, Chapter 3</li>
355
355
<li>Bik, Elisabeth M., Casadevall, Arturo, Fang, Ferrie C. The Prevalence of Inappropriate Image Duplication in Biomedical Research Publications.</li>
356
-
<li>Manku, Gurmeet Singh, Arvind Jain, and Anish Das Sarma. "Detecting near-duplicates for web crawling." Proceedings of the 16th international conference on World Wide Web. ACM, 2007. <strong>(Presented by: Pavle Medvidovic)</strong></li>
357
-
<li>Henzinger, Monika. "Finding near-duplicate web pages: a large-scale evaluation of algorithms." Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 2006. <strong>(Presented by: Angel Chavez-Penate)</strong></li>
358
-
<li>Cooper, Matthew, Jonathan Foote, and Andreas Girgensohn. "Automatically organizing digital photographs using time and content." Image Processing, 2003. ICIP 2003. Proceedings. 2003 International Conference on. Vol. 3. IEEE, 2003. <strong>(Presented by: <ahref="https://www.youtube.com/watch?v=FHtoj5EwMLw&t=1s">Kameswari Sridhara</a>)</strong></li>
359
-
<li>Manber, Udi. "Finding similar files in a large file system." Usenix Winter. Vol. 94. 1994. <strong>(Presented by: Jingyi Wang)</strong></li>
360
-
<li>Chim, Hung, and Xiaotie Deng. "Efficient phrase-based document similarity for clustering." IEEE Transactions on Knowledge and Data Engineering 20.9 (2008): 1217-1229. <strong>(Presented by: Andrew Bruneel)</strong></li>
356
+
<li>Manku, Gurmeet Singh, Arvind Jain, and Anish Das Sarma. "Detecting near-duplicates for web crawling." Proceedings of the 16th international conference on World Wide Web. ACM, 2007. <strong>(Presented by: )</strong></li>
357
+
<li>Henzinger, Monika. "Finding near-duplicate web pages: a large-scale evaluation of algorithms." Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 2006. <strong>(Presented by: )</strong></li>
358
+
<li>Cooper, Matthew, Jonathan Foote, and Andreas Girgensohn. "Automatically organizing digital photographs using time and content." Image Processing, 2003. ICIP 2003. Proceedings. 2003 International Conference on. Vol. 3. IEEE, 2003. <strong>(Presented by: )</strong></li>
359
+
<li>Manber, Udi. "Finding similar files in a large file system." Usenix Winter. Vol. 94. 1994. <strong>(Presented by: )</strong></li>
360
+
<li>Chim, Hung, and Xiaotie Deng. "Efficient phrase-based document similarity for clustering." IEEE Transactions on Knowledge and Data Engineering 20.9 (2008): 1217-1229. <strong>(Presented by: )</strong></li>
<li>Advanced File System Statistics and Understanding</li>
376
376
</ul></td>
377
377
<td><ulclass="text-left"><li>Tika in Action, Chapter 4</li>
378
-
<li>Amirani, Mehdi Chehel, Mohsen Toorani, and A. Beheshti. A new approach to content-based file type detection. Computers and Communications, 2008. ISCC 2008. IEEE Symposium on. IEEE, 2008 <strong>(Presented by: Tala Tayebi)</strong></li>
379
-
<li>McDaniel, Mason, and M. Hossain Heydari. Content based file type detection algorithms. System Sciences, 2003. Proceedings of the 36th Annual Hawaii International Conference on. IEEE, 2003.<strong>(Presented by: <ahref="https://www.youtube.com/watch?v=1HR9VaezzVE&t=105s">Derrick Hsu</a>)</strong></li>
380
-
<li>Alamri, Nasser S., and William H. Allen. "A comparative study of file-type identification techniques." SoutheastCon 2015. IEEE, 2015.<strong>(Presented by: Ric Xian)</strong></li>
378
+
<li>Amirani, Mehdi Chehel, Mohsen Toorani, and A. Beheshti. A new approach to content-based file type detection. Computers and Communications, 2008. ISCC 2008. IEEE Symposium on. IEEE, 2008 <strong>(Presented by: )</strong></li>
379
+
<li>McDaniel, Mason, and M. Hossain Heydari. Content based file type detection algorithms. System Sciences, 2003. Proceedings of the 36th Annual Hawaii International Conference on. IEEE, 2003.<strong>(Presented by: )</strong></li>
380
+
<li>Alamri, Nasser S., and William H. Allen. "A comparative study of file-type identification techniques." SoutheastCon 2015. IEEE, 2015.<strong>(Presented by: )</strong></li>
381
381
<li>Li, Wei-Jen, et al. "Fileprints: Identifying file types by n-gram analysis." Information Assurance Workshop, 2005. IAW'05. Proceedings from the Sixth Annual IEEE SMC. IEEE, 2005. </li>
382
382
<li>Shahi, Ashim. "Classifying the classifiers for file fragment classification." Masters Thesis, Universiteit van Amsterdam (2012). </li>
383
383
<li>Ahmed, Irfan, et al. "Fast file-type identification." Proceedings of the 2010 ACM Symposium on Applied Computing. ACM, 2010. </li>
384
384
<li>Pierris, Georgios, and Stilianos Vidalis. "Forensically classifying files using HSOM algorithms." Emerging Intelligent Data and Web Technologies (EIDWT), 2012 Third International Conference on. IEEE, 2012.</li>
385
-
<li>Harris, Ryan M. "Using artificial neural networks for forensic file type identification." Master's Thesis, Purdue University (2007). <strong>(Presented by: Xiaoyan Zhang)</strong></li>
386
-
<li>Douceur, John R., and William J. Bolosky. A large-scale study of file-system contents. ACM SIGMETRICS Performance Evaluation Review 27.1 (1999): 59-70. <strong>(Presented by: <ahref="https://www.youtube.com/watch?v=AAztusIz4AI&t=1s">Kristian Castellanos</a>)</strong></li>
385
+
<li>Harris, Ryan M. "Using artificial neural networks for forensic file type identification." Master's Thesis, Purdue University (2007). <strong>(Presented by: )</strong></li>
386
+
<li>Douceur, John R., and William J. Bolosky. A large-scale study of file-system contents. ACM SIGMETRICS Performance Evaluation Review 27.1 (1999): 59-70. <strong>(Presented by: )</strong></li>
0 commit comments