Skip to content

Commit 850186b

Browse files
committed
Updates through week 4.
1 parent 9a42d74 commit 850186b

1 file changed

Lines changed: 29 additions & 29 deletions

File tree

classes/dsci550_2024a/index.html

Lines changed: 29 additions & 29 deletions
Original file line numberDiff line numberDiff line change
@@ -109,10 +109,11 @@ <h2 class="section-heading">DSCI 550: <font class="usc-color">Data Science at Sc
109109
<div class="row">
110110
<div class="col-md-16">
111111
<h2>Class Info</h2>
112-
Spring Semester, 2023<br/>
112+
Spring Semester, 2024<br/>
113113
<label>Location</label>
114114
: OHE 100D and online<br/>
115-
<label>Time</label>: Th 3:30-6:50pm<br/>
115+
<label>Time</label>
116+
: Th 4:00-7:20pm<br/>
116117
<label>Class number</label>
117118
: 32413D<br/>
118119
<label>Class number</label>
@@ -129,12 +130,11 @@ <h2>Instructor</h2>
129130
<label>Office Hours:</label>
130131
&nbsp;2:30pm-3:30pm <b>PHE 514</b>&nbsp;(right before class) </div>
131132
<div class="col-md-4">
132-
<h2>Teaching Assistant</h2>
133+
<h2>Course Producer</h2>
133134
<div class="student">
134135
<a href="www.linkedin.com/in/suchith-prathapaneni" target="blank">
135-
<img alt="Suchith Prathapaneni" src="../../images/headshots/suchith_photo.png"><br>
136-
<label>Suchith Prathapaneni </label></a><br><strong>E-Mail:</strong>&nbsp;<a href="mailto:sprathap@usc.edu">sprathap@usc.edu</a><br>
137-
<label>Office Hours:</label>&nbsp; Every Tuesday 2-3 PM &nbsp;
136+
<img alt="TBD" src="../../images/headshots/avatar-placeholder.svg"><br>
137+
<label>TBD </label></a><br><strong>E-Mail:</strong>&nbsp;<a href="mailto:tbd@usc.edu">tbd@usc.edu</a><br>
138138
</div>
139139

140140
</div>
@@ -292,21 +292,21 @@ <h2 class="section-heading">Schedule <font class="usc-color">(subject to change;
292292
</thead>
293293

294294
<tr>
295-
<td>1<br/><p class="text-emphasis-style" style="font-size:10px;">(Jan 12th, 2023)</p></td>
295+
<td>1<br/><p class="text-emphasis-style" style="font-size:10px;">(Jan 11th, 2024)</p></td>
296296
<td><ul class="text-left">
297297
<li>Course Introduction</li>
298298
<li>Introduction to Big Data</li>
299299
<li>DARPA XDATA Program - Overview Slides</li>
300300
<li>Breakout Groups on Big Data</li>
301301
</ul></td>
302302
<td><ul class="text-left"><li>Tika in Action, Chapter 1</li>
303-
<li>Mattmann, Chris. A vision for data science. Nature, Vol. 493, No. 7433, pp. 473-475, January 24, 2013. <strong>(Presented by: Alexander Mermelstein)</strong></li>
303+
<li>Mattmann, Chris. A vision for data science. Nature, Vol. 493, No. 7433, pp. 473-475, January 24, 2013. <strong>(Presented by: )</strong></li>
304304
<li>Lynch, Clifford. "Big data: How do your data grow?." Nature 455.7209 (2008): 28-29.</li>
305305
<li>Howe, Doug, et al. "Big data: The future of biocuration." Nature 455.7209 (2008): 47-50.</li>
306-
<li>Wigan, Marcus R., and Roger Clarke. "Big data's big unintended consequences." Computer 46.6 (2013): 46-53. <strong>(Presented by: Josephina Bian)</strong></li>
306+
<li>Wigan, Marcus R., and Roger Clarke. "Big data's big unintended consequences." Computer 46.6 (2013): 46-53. <strong>(Presented by: )</strong></li>
307307
<li>Schwartz, J. A. N. A., et al. "Measuring the value of Big Data exploitation systems: Quantitative, non-subjective metrics with the user as a key component." Parsons Journal for Information Mapping 6 (2014): 1-12.</li>
308-
<li>Sotera Defense Solutions. A Survey of Big Data Methods, Assessments, and Approaches. November 2012 <strong>(Presented by: Bess Djavadi)</strong></li>
309-
<li>De Mauro, Andrea, Marco Greco, and Michele Grimaldi. "What is big data? A consensual definition and a review of key research topics." AIP conference proceedings. Vol. 1644. No. 1. AIP, 2015. <strong>(Presented by: Bongjun Kim)</strong></li>
308+
<li>Sotera Defense Solutions. A Survey of Big Data Methods, Assessments, and Approaches. November 2012 <strong>(Presented by: )</strong></li>
309+
<li>De Mauro, Andrea, Marco Greco, and Michele Grimaldi. "What is big data? A consensual definition and a review of key research topics." AIP conference proceedings. Vol. 1644. No. 1. AIP, 2015. <strong>(Presented by: )</strong></li>
310310
</ul>
311311
</td>
312312
<td>Resources:<br/><br>
@@ -319,7 +319,7 @@ <h2 class="section-heading">Schedule <font class="usc-color">(subject to change;
319319
</tr>
320320

321321
<tr>
322-
<td>2<br/><p class="text-emphasis-style" style="font-size:10px;">(Jan 19th, 2023)</p></td>
322+
<td>2<br/><p class="text-emphasis-style" style="font-size:10px;">(Jan 18th, 2024)</p></td>
323323
<td><ul class="text-left">
324324
<li>Report out from Big Data Breakouts</li>
325325
<li>A Taxonomy of File Formats</li>
@@ -330,13 +330,13 @@ <h2 class="section-heading">Schedule <font class="usc-color">(subject to change;
330330
<td><ul class="text-left"><li>Tika in Action, Chapter 2</li>
331331
<li>Crocker, David. RFC 822 "Standard for the format of ARPA Internet text messages." (1982).</li>
332332
<li>Freed, Ned and Nathaniel Borenstein. RFC 1341. MIME (Multipurpose Internet Mail Extensions). Mechanisms for Specifying and Describing
333-
the Format of Internet Message Bodies. June 1992. <strong>(Presented by: <a href="https://youtu.be/nYWkoGhFDaw">Ruoming Li</a>)</strong></li>
334-
<li>Freed, Ned, and Nathaniel Borenstein. RFC 2045. Multipurpose internet mail extensions (MIME) part one: Format of internet message bodies. 1996.<strong>(Presented by: <a href="https://youtu.be/awAe-Pu4KMQ">Tania Dawood</a>)</strong></li>
333+
the Format of Internet Message Bodies. June 1992. <strong>(Presented by: )</strong></li>
334+
<li>Freed, Ned, and Nathaniel Borenstein. RFC 2045. Multipurpose internet mail extensions (MIME) part one: Format of internet message bodies. 1996.<strong>(Presented by: )</strong></li>
335335
<li>Freed, Ned, and Nathaniel Borenstein. RFC 2046 Multipurpose internet mail extensions (MIME) part two: Media types, November, 1996.</li>
336-
<li>Freed, Ned. RFC 2048 "Multipurpose internet mail extensions (MIME) part four: Registration procedures." ISI (1996). <strong>(Presented by: <a href="https://youtu.be/Xo0n2dgpDPU">Jimin Ding</a>)</strong></li>
336+
<li>Freed, Ned. RFC 2048 "Multipurpose internet mail extensions (MIME) part four: Registration procedures." ISI (1996). <strong>(Presented by: </li>
337337
<li>Hicks, Ben J., et al. "Organizing and managing personal electronic files: A mechanical engineer's perspective." ACM Transactions on Information Systems (TOIS) 26.4 (2008): 23.</li>
338-
<li>Shim, Jungwon Roy. "Arium: Beyond the Desktop Metaphor: A new way of navigating, searching, and organizing personal digital data." Masters Thesus, Carnegie Mellon University (2012).<strong>(Presented by: <a href="https://youtu.be/1qsEHxS2cMM)">Kelly Choy</a>)</strong></li>
339-
<li>Crowder, Jerome, Jonathan Marion, and Michele Reilly. "File Naming in Digital Media Research: Examples from the Humanities and Social Sciences." Journal of Librarianship and Scholarly Communication 3.3 (2015). <strong>(Presented by: <a href="https://www.youtube.com/watch?v=zYuI2hkmymA">Annie Chang</a>)</strong></li>
338+
<li>Shim, Jungwon Roy. "Arium: Beyond the Desktop Metaphor: A new way of navigating, searching, and organizing personal digital data." Masters Thesus, Carnegie Mellon University (2012).<strong>(Presented by: )</strong></li>
339+
<li>Crowder, Jerome, Jonathan Marion, and Michele Reilly. "File Naming in Digital Media Research: Examples from the Humanities and Social Sciences." Journal of Librarianship and Scholarly Communication 3.3 (2015). <strong>(Presented by: )</strong></li>
340340
<li>Jackson, Andrew N. "Formats over time: Exploring UK web history." arXiv preprint arXiv:1210.1714 (2012). </li>
341341
</ul>
342342
</td>
@@ -345,19 +345,19 @@ <h2 class="section-heading">Schedule <font class="usc-color">(subject to change;
345345

346346
<tr>
347347
<td>3<br/>
348-
<p class="text-emphasis-style" style="font-size:10px;">(Jan 26th, 2023)</p></td>
348+
<p class="text-emphasis-style" style="font-size:10px;">(Jan 25th, 2024)</p></td>
349349
<td><ul class="text-left">
350350
<li>Report outs from the in class discussion around classifying files and the MIME taxonomy</li>
351351
<li>Document Similarity and Deduplication</li>
352352
<li>Individual Presentations - Week 2 Papers</li>
353353
</ul></td>
354354
<td><ul class="text-left"><li>Tika in Action, Chapter 3</li>
355355
<li>Bik, Elisabeth M., Casadevall, Arturo, Fang, Ferrie C. The Prevalence of Inappropriate Image Duplication in Biomedical Research Publications.</li>
356-
<li>Manku, Gurmeet Singh, Arvind Jain, and Anish Das Sarma. "Detecting near-duplicates for web crawling." Proceedings of the 16th international conference on World Wide Web. ACM, 2007. <strong>(Presented by: Pavle Medvidovic)</strong></li>
357-
<li>Henzinger, Monika. "Finding near-duplicate web pages: a large-scale evaluation of algorithms." Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 2006. <strong>(Presented by: Angel Chavez-Penate)</strong> </li>
358-
<li>Cooper, Matthew, Jonathan Foote, and Andreas Girgensohn. "Automatically organizing digital photographs using time and content." Image Processing, 2003. ICIP 2003. Proceedings. 2003 International Conference on. Vol. 3. IEEE, 2003. <strong>(Presented by: <a href="https://www.youtube.com/watch?v=FHtoj5EwMLw&t=1s">Kameswari Sridhara</a>)</strong></li>
359-
<li>Manber, Udi. "Finding similar files in a large file system." Usenix Winter. Vol. 94. 1994. <strong>(Presented by: Jingyi Wang)</strong></li>
360-
<li>Chim, Hung, and Xiaotie Deng. "Efficient phrase-based document similarity for clustering." IEEE Transactions on Knowledge and Data Engineering 20.9 (2008): 1217-1229. <strong>(Presented by: Andrew Bruneel)</strong></li>
356+
<li>Manku, Gurmeet Singh, Arvind Jain, and Anish Das Sarma. "Detecting near-duplicates for web crawling." Proceedings of the 16th international conference on World Wide Web. ACM, 2007. <strong>(Presented by: )</strong></li>
357+
<li>Henzinger, Monika. "Finding near-duplicate web pages: a large-scale evaluation of algorithms." Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 2006. <strong>(Presented by: )</strong> </li>
358+
<li>Cooper, Matthew, Jonathan Foote, and Andreas Girgensohn. "Automatically organizing digital photographs using time and content." Image Processing, 2003. ICIP 2003. Proceedings. 2003 International Conference on. Vol. 3. IEEE, 2003. <strong>(Presented by: )</strong></li>
359+
<li>Manber, Udi. "Finding similar files in a large file system." Usenix Winter. Vol. 94. 1994. <strong>(Presented by: )</strong></li>
360+
<li>Chim, Hung, and Xiaotie Deng. "Efficient phrase-based document similarity for clustering." IEEE Transactions on Knowledge and Data Engineering 20.9 (2008): 1217-1229. <strong>(Presented by: )</strong></li>
361361
</ul>
362362
</td>
363363
<td>Resources:&nbsp;<br/><br/><ul classs="text-left">
@@ -368,22 +368,22 @@ <h2 class="section-heading">Schedule <font class="usc-color">(subject to change;
368368

369369
<tr>
370370
<td>4<br/>
371-
<p class="text-emphasis-style" style="font-size:10px;">(Feb 2nd, 2023)</p></td>
371+
<p class="text-emphasis-style" style="font-size:10px;">(Feb 1st, 2024)</p></td>
372372
<td><ul class="text-left">
373373
<li>Document Type Detection</li>
374374
<li>Individual Presentations - Week 3 papers</li>
375375
<li>Advanced File System Statistics and Understanding</li>
376376
</ul></td>
377377
<td><ul class="text-left"><li>Tika in Action, Chapter 4</li>
378-
<li>Amirani, Mehdi Chehel, Mohsen Toorani, and A. Beheshti. A new approach to content-based file type detection. Computers and Communications, 2008. ISCC 2008. IEEE Symposium on. IEEE, 2008 <strong>(Presented by: Tala Tayebi)</strong> </li>
379-
<li>McDaniel, Mason, and M. Hossain Heydari. Content based file type detection algorithms. System Sciences, 2003. Proceedings of the 36th Annual Hawaii International Conference on. IEEE, 2003.<strong>(Presented by: <a href="https://www.youtube.com/watch?v=1HR9VaezzVE&t=105s">Derrick Hsu</a>)</strong></li>
380-
<li>Alamri, Nasser S., and William H. Allen. "A comparative study of file-type identification techniques." SoutheastCon 2015. IEEE, 2015.<strong>(Presented by: Ric Xian)</strong></li>
378+
<li>Amirani, Mehdi Chehel, Mohsen Toorani, and A. Beheshti. A new approach to content-based file type detection. Computers and Communications, 2008. ISCC 2008. IEEE Symposium on. IEEE, 2008 <strong>(Presented by: )</strong> </li>
379+
<li>McDaniel, Mason, and M. Hossain Heydari. Content based file type detection algorithms. System Sciences, 2003. Proceedings of the 36th Annual Hawaii International Conference on. IEEE, 2003.<strong>(Presented by: )</strong></li>
380+
<li>Alamri, Nasser S., and William H. Allen. "A comparative study of file-type identification techniques." SoutheastCon 2015. IEEE, 2015.<strong>(Presented by: )</strong></li>
381381
<li>Li, Wei-Jen, et al. "Fileprints: Identifying file types by n-gram analysis." Information Assurance Workshop, 2005. IAW'05. Proceedings from the Sixth Annual IEEE SMC. IEEE, 2005. </li>
382382
<li>Shahi, Ashim. "Classifying the classifiers for file fragment classification." Masters Thesis, Universiteit van Amsterdam (2012). </li>
383383
<li>Ahmed, Irfan, et al. "Fast file-type identification." Proceedings of the 2010 ACM Symposium on Applied Computing. ACM, 2010. </li>
384384
<li>Pierris, Georgios, and Stilianos Vidalis. "Forensically classifying files using HSOM algorithms." Emerging Intelligent Data and Web Technologies (EIDWT), 2012 Third International Conference on. IEEE, 2012.</li>
385-
<li>Harris, Ryan M. "Using artificial neural networks for forensic file type identification." Master's Thesis, Purdue University (2007). <strong>(Presented by: Xiaoyan Zhang)</strong></li>
386-
<li>Douceur, John R., and William J. Bolosky. A large-scale study of file-system contents. ACM SIGMETRICS Performance Evaluation Review 27.1 (1999): 59-70. <strong>(Presented by: <a href="https://www.youtube.com/watch?v=AAztusIz4AI&t=1s">Kristian Castellanos</a>)</strong></li>
385+
<li>Harris, Ryan M. "Using artificial neural networks for forensic file type identification." Master's Thesis, Purdue University (2007). <strong>(Presented by: )</strong></li>
386+
<li>Douceur, John R., and William J. Bolosky. A large-scale study of file-system contents. ACM SIGMETRICS Performance Evaluation Review 27.1 (1999): 59-70. <strong>(Presented by: )</strong></li>
387387
</ul>
388388
</td>
389389
<td>Resources:&nbsp;<br/><br/>

0 commit comments

Comments
 (0)