Skip to content

Commit 1c6aaa3

Browse files
committed
Updates through week 16.
1 parent 2387c22 commit 1c6aaa3

1 file changed

Lines changed: 27 additions & 27 deletions

File tree

classes/dsci550_2024a/index.html

Lines changed: 27 additions & 27 deletions
Original file line numberDiff line numberDiff line change
@@ -527,19 +527,19 @@ <h2 class="section-heading">Schedule <font class="usc-color">(subject to change;
527527
</tr>
528528
<tr>
529529
<td>11<br/>
530-
<p class="text-emphasis-style" style="font-size:10px;">(March 23rd, 2023)</p></td>
530+
<p class="text-emphasis-style" style="font-size:10px;">(March 21st, 2024)</p></td>
531531
<td><ul class="text-left">
532532
<li>Individual Presentations</li>
533533
<li>Named Entity Recognition</li>
534534
<li>Hadoop Spark and Tika: Large Scale Content Detection and Analysis</li>
535535
</ul></td>
536536
<td><ul class="text-left"><li>Tika in Action, Chapter 9</li>
537-
<li>Dean, Jeffrey, and Sanjay Ghemawat. MapReduce: simplified data processing on large clusters. Communications of the ACM 51.1 (2008): 107-113.<strong>(Presented by: Anqi Wu)</strong></li>
538-
<li>Zaharia, Matei, et al. Spark: cluster computing with working sets.Proceedings of the 2nd USENIX conference on Hot topics in cloud computing. Vol. 10. 2010.<strong>(Presented by: Wonseuk Her)</strong></li>
539-
<li>Elsayed, Tamer, Jimmy Lin, and Douglas W. Oard. "Pairwise document similarity in large collections with MapReduce." Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers. Association for Computational Linguistics, 2008. <strong>(Presented by: Haorui Ni)</strong> </li>
540-
<li>M. Bernaschi, M. Cianfriglia, A. Di Marco, A. Sabellico, G. Me, G. Carbone, G. Totaro. Forensic Disk Image Indexing and Search in an HPC environment. IEEE International Conference on High Performance Computing &amp; Simulation (HPCS), 2014.<strong>(Presented by: Arya Sun)</strong></li>
537+
<li>Dean, Jeffrey, and Sanjay Ghemawat. MapReduce: simplified data processing on large clusters. Communications of the ACM 51.1 (2008): 107-113.<strong>(Presented by: )</strong></li>
538+
<li>Zaharia, Matei, et al. Spark: cluster computing with working sets.Proceedings of the 2nd USENIX conference on Hot topics in cloud computing. Vol. 10. 2010.<strong>(Presented by: )</strong></li>
539+
<li>Elsayed, Tamer, Jimmy Lin, and Douglas W. Oard. "Pairwise document similarity in large collections with MapReduce." Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers. Association for Computational Linguistics, 2008. <strong>(Presented by: )</strong> </li>
540+
<li>M. Bernaschi, M. Cianfriglia, A. Di Marco, A. Sabellico, G. Me, G. Carbone, G. Totaro. Forensic Disk Image Indexing and Search in an HPC environment. IEEE International Conference on High Performance Computing &amp; Simulation (HPCS), 2014.<strong>(Presented by: )</strong></li>
541541
<li>Meusel, Robert, Peter Mika, and Roi Blanco. "Focused crawling for structured data." Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management. ACM, 2014.</li>
542-
<li>Niu, Feng, et al. "DeepDive: Web-scale Knowledge-base Construction using Statistical Learning and Inference." VLDS 12 (2012): 25-28.<strong>(Presented by: Yenlin Lee)</strong></li>
542+
<li>Niu, Feng, et al. "DeepDive: Web-scale Knowledge-base Construction using Statistical Learning and Inference." VLDS 12 (2012): 25-28.<strong>(Presented by: )</strong></li>
543543
<li>Mattmann, C. A., Oh, J. H., Palsulich, T., McGibbney, L. J., Gil, Y., &amp; Ratnakar, V. (2015, November). DRAT: An Unobtrusive, Scalable Approach to Large Scale Software License Analysis. In Automated Software Engineering Workshop (ASEW), 2015 30th IEEE/ACM International Conference on (pp. 97-101). IEEE. </li>
544544
</ul>
545545
</td>
@@ -553,7 +553,7 @@ <h2 class="section-heading">Schedule <font class="usc-color">(subject to change;
553553

554554
<tr>
555555
<td>12<br/>
556-
<p class="text-emphasis-style" style="font-size:10px;">(March 30th, 2023)</p></td>
556+
<p class="text-emphasis-style" style="font-size:10px;">(March 28th, 2024)</p></td>
557557
<td><ul class="text-left">
558558
<li>Assignment 3 - Introduction</li>
559559
<li>Readout - Named Entity Recognition Group Presentations</li>
@@ -564,12 +564,12 @@ <h2 class="section-heading">Schedule <font class="usc-color">(subject to change;
564564
<td><ul class="text-left">
565565
<li>Tika in Action, Chapter 10</li>
566566
<li>Białecki, Andrzej, et al. "Apache lucene 4." SIGIR 2012 workshop on open source information retrieval. 2012. </li>
567-
<li>Turtle, Howard, Yatish Hegde, and S. Rowe. "Yet another comparison of lucene and indri performance." SIGIR 2012 Workshop on Open Source Information Retrieval. 2012.<strong>(Presented by: Kiani Sheppard)</strong></li>
568-
<li>Bontcheva, Kalina, et al. "TwitIE: An Open-Source Information Extraction Pipeline for Microblog Text." RANLP. 2013.<strong>(Presented by: Yi Chang)</strong></li>
567+
<li>Turtle, Howard, Yatish Hegde, and S. Rowe. "Yet another comparison of lucene and indri performance." SIGIR 2012 Workshop on Open Source Information Retrieval. 2012.<strong>(Presented by: )</strong></li>
568+
<li>Bontcheva, Kalina, et al. "TwitIE: An Open-Source Information Extraction Pipeline for Microblog Text." RANLP. 2013.<strong>(Presented by: )</strong></li>
569569
<li>Cunningham, Hamish. "GATE, a general architecture for text engineering." Computers and the Humanities 36.2 (2002): 223-254.</li>
570570
<li>Atserias, Jordi, et al. "FreeLing 1.3: Syntactic and semantic services in an open-source NLP library." Proceedings of LREC. Vol. 6. 2006. </li>
571-
<li>Manning, Christopher D., et al. "The stanford corenlp natural language processing toolkit." ACL (System Demonstrations). 2014.<strong>(Presented by: Mingyu Zong)</strong></li>
572-
<li>Savova, Guergana K., et al. "Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications." Journal of the American Medical Informatics Association 17.5 (2010): 507-513.&nbsp;<strong>(Presented by: Kelly Fong)</strong></li>
571+
<li>Manning, Christopher D., et al. "The stanford corenlp natural language processing toolkit." ACL (System Demonstrations). 2014.<strong>(Presented by: )</strong></li>
572+
<li>Savova, Guergana K., et al. "Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications." Journal of the American Medical Informatics Association 17.5 (2010): 507-513.&nbsp;<strong>(Presented by: )</strong></li>
573573
</ul>
574574
</td>
575575
<td>Resources:<br/>
@@ -583,18 +583,18 @@ <h2 class="section-heading">Schedule <font class="usc-color">(subject to change;
583583

584584
<tr>
585585
<td>13<br/>
586-
<p class="text-emphasis-style" style="font-size:10px;">(April 6th, 2023)</p></td>
586+
<p class="text-emphasis-style" style="font-size:10px;">(April 4th, 2024)</p></td>
587587
<td><ul class="text-left">
588588
<li>Ted Talk (click link in resources)</li>
589589
<li>Evaluating Content Detection</li>
590590
<li>Walkthrough of Polar Deep Insights</li>
591591
<li>Individual Presentations </li>
592592
</ul></td>
593593
<td><ul class="text-left"><li>Tika in Action, Chapter 11</li>
594-
<li>Nowell, Lucy Terry, et al. "Visualizing search results: some alternatives to query-document similarity." Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 1996. <strong>&nbsp;(Presented by: Christy Xie)</strong></li>
595-
<li>Shneiderman, Ben. "The eyes have it: A task by data type taxonomy for information visualizations." Visual Languages, 1996. Proceedings., IEEE Symposium on. IEEE, 1996. <strong>(Presented by: Shreya Raj)</strong></li>
596-
<li>Gottron, Thomas. "Evaluating content extraction on HTML documents." Proceedings of the 2nd International Conference on Internet Technologies and Applications (ITA’07). 2007. <strong>(Presented by: Kaixin Guo)</strong></li>
597-
<li>Leuski, Anton. "Evaluating document clustering for interactive information retrieval." Proceedings of the tenth international conference on Information and knowledge management. ACM, 2001. <strong>(Presented by: Jai Agrawal)</strong></li>
594+
<li>Nowell, Lucy Terry, et al. "Visualizing search results: some alternatives to query-document similarity." Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 1996. <strong>&nbsp;(Presented by: )</strong></li>
595+
<li>Shneiderman, Ben. "The eyes have it: A task by data type taxonomy for information visualizations." Visual Languages, 1996. Proceedings., IEEE Symposium on. IEEE, 1996. <strong>(Presented by: )</strong></li>
596+
<li>Gottron, Thomas. "Evaluating content extraction on HTML documents." Proceedings of the 2nd International Conference on Internet Technologies and Applications (ITA’07). 2007. <strong>(Presented by: )</strong></li>
597+
<li>Leuski, Anton. "Evaluating document clustering for interactive information retrieval." Proceedings of the tenth international conference on Information and knowledge management. ACM, 2001. <strong>(Presented by: )</strong></li>
598598
<li>Bailey, Peter, et al. "Evaluating search systems using result page context." Proceedings of the third symposium on Information interaction in context. ACM, 2010.</li>
599599
</ul>
600600
</td>
@@ -606,7 +606,7 @@ <h2 class="section-heading">Schedule <font class="usc-color">(subject to change;
606606

607607
<tr>
608608
<td>14<br/>
609-
<p class="text-emphasis-style" style="font-size:10px;">(April 13th, 2023)</p></td>
609+
<p class="text-emphasis-style" style="font-size:10px;">(April 11th, 2024)</p></td>
610610
<td><ul class="text-left">
611611
<li>Group Readouts on Evaluating Content Detection and Analysis</li>
612612
<li>Lecture on NoSQL</li>
@@ -616,11 +616,11 @@ <h2 class="section-heading">Schedule <font class="usc-color">(subject to change;
616616
</td>
617617
<td><ul class="text-left">
618618
<li>Palamuttam, Rahul, et al. "SciSpark: Applying in-memory distributed computing to weather event detection and tracking." Big Data (Big Data), 2015 IEEE International Conference on. IEEE, 2015.</li>
619-
<li>Leavitt, Neal. "Will NoSQL databases live up to their promise?." Computer 43.2 (2010). <strong>(Presented by: Adelaide Han)</strong></li>
620-
<li>Stonebraker, Michael. "SQL databases v. NoSQL databases." Communications of the ACM 53.4 (2010): 10-11.<strong>(Presented by: Tongxin Ye)</strong></li>
619+
<li>Leavitt, Neal. "Will NoSQL databases live up to their promise?." Computer 43.2 (2010). <strong>(Presented by: )</strong></li>
620+
<li>Stonebraker, Michael. "SQL databases v. NoSQL databases." Communications of the ACM 53.4 (2010): 10-11.<strong>(Presented by: )</strong></li>
621621
<li>Stonebraker, Michael. "Stonebraker on NoSQL and enterprises." Communications of the ACM 54.8 (2011): 10-11. </li>
622-
<li>Rafique, Ansar, et al. "On the performance impact of data access middleware for nosql data stores." IEEE Transactions on Cloud Computing (2015). <strong>(Presented by: Ara Nazari)</strong></li>
623-
<li>Moniruzzaman, A. B. M., and Syed Akhter Hossain. "Nosql database: New era of databases for big data analytics-classification, characteristics and comparison." arXiv preprint arXiv:1307.0191 (2013). <strong>(Presented by: Xin Sun)</strong> </li>
622+
<li>Rafique, Ansar, et al. "On the performance impact of data access middleware for nosql data stores." IEEE Transactions on Cloud Computing (2015). <strong>(Presented by: )</strong></li>
623+
<li>Moniruzzaman, A. B. M., and Syed Akhter Hossain. "Nosql database: New era of databases for big data analytics-classification, characteristics and comparison." arXiv preprint arXiv:1307.0191 (2013). <strong>(Presented by: )</strong> </li>
624624
</ul>
625625
</td>
626626
<td>Resources:<br/>
@@ -633,7 +633,7 @@ <h2 class="section-heading">Schedule <font class="usc-color">(subject to change;
633633
</tr>
634634
<tr>
635635
<td>15<br/>
636-
<p class="text-emphasis-style" style="font-size:10px;">(April 20th, 2023)</p></td>
636+
<p class="text-emphasis-style" style="font-size:10px;">(April 18th, 2024)</p></td>
637637
<td><ul class="text-left">
638638
<li>Video - Scientific Data: Water and Snow in the Western US</li>
639639
<li>Searching Scientific Datasets</li>
@@ -645,11 +645,11 @@ <h2 class="section-heading">Schedule <font class="usc-color">(subject to change;
645645
<li>C. Mattmann, D. Freeborn, D. Crichton, B. Foster, A. Hart, D. Woollard, S. Hardman, P. Ramirez, S. Kelly, A. Y. Chang, C. E. Miller. A Reusable Process Control System Framework for the Orbiting Carbon Observatory and NPP Sounder PEATE missions. In Proceedings of the 3rd IEEE Intl Conference on Space Mission Challenges for Information Technology (SMC-IT 2009), pp. 165-172, July 19 - 23, 2009.
646646
</li>
647647
<li>Wilkinson, Mark D., et al. "The FAIR Guiding Principles for scientific data management and stewardship." Scientific data 3 (2016): 160018. </li>
648-
<li>Buneman, Peter, et al. "Archiving scientific data." ACM Transactions on Database Systems (TODS) 29.1 (2004): 2-42. <strong>(Presented by: Elizama Penaloza)</strong> </li>
649-
<li>Fox, Peter, and James Hendler. "Changing the equation on scientific data visualization." Science 331.6018 (2011): 705-708.<strong>(Presented by: Richard Gallardo)</strong></li>
648+
<li>Buneman, Peter, et al. "Archiving scientific data." ACM Transactions on Database Systems (TODS) 29.1 (2004): 2-42. <strong>(Presented by: )</strong> </li>
649+
<li>Fox, Peter, and James Hendler. "Changing the equation on scientific data visualization." Science 331.6018 (2011): 705-708.<strong>(Presented by: )</strong></li>
650650
<li>Plale, Beth, et al. "Active management of scientific data." IEEE Internet Computing 9.1 (2005): 27-34.</li>
651-
<li>Gray, Jim, et al. "Scientific data management in the coming decade." ACM SIGMOD Record 34.4 (2005): 34-41. <strong>(Presented by: Julia Huang)</strong></li>
652-
<li>Ailamaki, Anastasia, Verena Kantere, and Debabrata Dash. "Managing scientific data." Communications of the ACM 53.6 (2010): 68-78. <strong>(Presented by: Dhandeep Suglani)</strong></li>
651+
<li>Gray, Jim, et al. "Scientific data management in the coming decade." ACM SIGMOD Record 34.4 (2005): 34-41. <strong>(Presented by: )</strong></li>
652+
<li>Ailamaki, Anastasia, Verena Kantere, and Debabrata Dash. "Managing scientific data." Communications of the ACM 53.6 (2010): 68-78. <strong>(Presented by: )</strong></li>
653653
</ul>
654654
</td>
655655
<td>Resources:&nbsp;<br/>
@@ -660,7 +660,7 @@ <h2 class="section-heading">Schedule <font class="usc-color">(subject to change;
660660

661661
<tr>
662662
<td>16<br/>
663-
<p class="text-emphasis-style" style="font-size:10px;">(April 28th, 2023)</p></td>
663+
<p class="text-emphasis-style" style="font-size:10px;">(April 25th, 2024)</p></td>
664664
<td><ul class="text-left">
665665
<li>Big Data with an Eye Towards the Future: Discussion</li>
666666
</ul></td>

0 commit comments

Comments
 (0)