Amati, G., Amodeo, G., Bianchi, M., Celi, A., De Nicola, C., Flammini, M., Gaibisso, C., Gambosi, G., andMarcone, G. (2011). “Fub, iasi-cnr, univaq at trec 2011”. In:Text REtrieval Conference (TREC 2011). US.Antenucci, D., Handy, G., Modi, A., and Tinkerhess, M. (2011). “Classification of tweets via clustering of hashtags”.In:EECS545, pp. 1–11.Antonellis, G., Gavras, A. G., Panagiotou, M., Kutter, B. L., Guerrini, G., Sander, A. C., and Fox, P. J. (May 2015).“Shake Table Test of Large-Scale Bridge Columns Supported on Rocking Shallow Foundations”. In:Journal ofGeotechnical and Geoenvironmental Engineering141.5, p. 04015009.Barandela, R., Valdovinos, R. M., Sánchez, J. S., and Ferri, F. J. (2004). “The imbalanced training sample problem:Under or over sampling?” In:Joint IAPR international workshops on statistical techniques in pattern recognition(SPR) and structural and syntactic pattern recognition (SSPR). Springer, pp. 806–814.Buntain, C. L. and Sharma, S. (2020). “#pray4victims : Improving Classification of Crisis-Related Social MediaContent via Text Augmentation and Image Analysis”. In:The Text REtrieval Conference (TREC) 2020. JerseyCity, NY, USA: ACM.Chawla, N. V., Bowyer, K. W., Hall, L. O., and Kegelmeyer, W. P. (June 2002). “SMOTE: Synthetic MinorityOver-sampling Technique”. In:Journal of Artificial Intelligence Research16, pp. 321–357.Devlin, J., Chang, M.-W., Lee, K., and Toutanova, K. (2018). “Bert: Pre-training of deep bidirectional transformersfor language understanding”. In:arXiv preprint arXiv:1810.04805.Drucker, H., Burges, C. J., Kaufman, L., Smola, A., Vapnik, V., et al. (1997). “Support vector regression machines”.In:Advances in neural information processing systems9, pp. 155–161.Drummond, C., Holte, R. C., et al. (2003). “C4. 5, class imbalance, and cost sensitivity: why under-sampling beatsover-sampling”. In:Workshop on learning from imbalanced datasets II. Vol. 11. Citeseer, pp. 1–8.Felzenszwalb, P. F., Girshick, R. B., McAllester, D., and Ramanan, D. (Sept. 2010). “Object Detection withDiscriminatively Trained Part-Based Models”. In:IEEE Transactions on Pattern Analysis and Machine Intelligence32.9, pp. 1627–1645.Feurer, M. and Hutter, F. (2019). “Hyperparameter optimization”. In:Automated Machine Learning. Springer,Cham, pp. 3–33.Fung, I. C.-H., Yin, J., Pressley, K. D., Duke, C. H., Mo, C., Liang, H., Fu, K.-W., Tse, Z. T. H., and Hou, S.-I.(2019). “Pedagogical Demonstration of Twitter Data Analysis: A Case Study of World AIDS Day, 2014”. In:Data4.2, p. 84.He, H. and Garcia, E. A. (2009). “Learning from imbalanced data”. In:IEEE Transactions on knowledge and dataengineering21.9, pp. 1263–1284.Huang, C., Li, Y., Loy, C. C., and Tang, X. (June 2016). “Learning Deep Representation for Imbalanced Classification”.In:2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Japkowicz, N. and Stephen, S. (2002). “The class imbalance problem: A systematic study”. In:Intelligent dataanalysis6.5, pp. 429–449.Jeatrakul, P., Wong, K. W., and Fung, C. C. (2010). “Classification of Imbalanced Data by Combining theComplementary Neural Network and SMOTE Algorithm”. In:Neural Information Processing. Models andApplications, pp. 152–159.Kim, A., Miano, T., Chew, R., Eggers, M., and Nonnemaker, J. (2017). “Classification of Twitter users who tweetabout e-cigarettes”. In:JMIR public health and surveillance3.3, e63.Kim, E. H.-J., Jeong, Y. K., Kim, Y., Kang, K. Y., and Song, M. (2016). “Topic-based content and sentiment analysisof Ebola virus on Twitter and in the news”. In:Journal of Information Science42.6, pp. 763–781.Krawczyk, B. (Apr. 2016). “Learning from imbalanced data: open challenges and future directions”. In:Progress inArtificial Intelligence5.4, pp. 221–232.Maciejewski, T. and Stefanowski, J. (Apr. 2011). “Local neighbourhood extension of SMOTE for mining imbalanceddata”. In:2011 IEEE Symposium on Computational Intelligence and Data Mining (CIDM).McCreadie, R., Buntain, C., and Soboroff, I. (2019). “TREC Incident Streams: Finding Actionable Information onSocial Media”. In:Proceedings of the 16th International Conference on Information Systems for Crisis Responseand Management (ISCRAM).McCreadie, R., Buntain, C., and Soboroff, I. (2020). “Incident Streams 2019: Actionable Insights and How to FindThem”. In:Müller, M., Salathé, M., and Kummervold, P. E. (2020). “COVID-Twitter-BERT: A Natural Language ProcessingModel to Analyse COVID-19 Content on Twitter”. In:arXiv preprint arXiv:2005.07503.Nagar, R., Yuan, Q., Freifeld, C. C., Santillana, M., Nojima, A., Chunara, R., and Brownstein, J. S. (2014). “A casestudy of the New York City 2012-2013 influenza season with daily geocoded Twitter data from temporal andspatiotemporal perspectives”. In:Journal of medical Internet research16.10, e236.Quinlan, J. R. (1986). “Induction of decision trees”. In:Machine learning1.1, pp. 81–106.Razavian, A. S., Azizpour, H., Sullivan, J., and Carlsson, S. (June 2014). “CNN Features Off-the-Shelf: AnAstounding Baseline for Recognition”. In:2014 IEEE Conference on Computer Vision and Pattern RecognitionWorkshops.Shrivastava, A., Gupta, A., and Girshick, R. (June 2016). “Training Region-Based Object Detectors with OnlineHard Example Mining”. In:2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Simonyan, K. and Zisserman, A. (2014).Very Deep Convolutional Networks for Large-Scale Image Recognition.arXiv:1409.1556 [cs.CV].Song, H. O., Xiang, Y., Jegelka, S., and Savarese, S. (June 2016). “Deep Metric Learning via Lifted StructuredFeature Embedding”. In:2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Wang, C. and Lillis, D. (2020). “#pray4victims : Multi-task transfer learning for finding actionable information fromcrisis-related messages on social media”. In:The Text REtrieval Conference (TREC) 2020. Dublin, Ireland: ACM.Wang, X. and Gupta, A. (Dec. 2015). “Unsupervised Learning of Visual Representations Using Videos”. In:2015IEEE International Conference on Computer Vision (ICCV).Weiss, G. M. (2004). “Mining with rarity: a unifying framework”. In:ACM Sigkdd Explorations Newsletter6.1,pp. 7–19.Widener, M. J. and Li, W. (2014). “Using geolocated Twitter data to monitor the prevalence of healthy and unhealthyfood references across the US”. In:Applied Geography54, pp. 189–197.Wright, R. E. (1995). “Logistic regression.” In:Xia, P., Wu, S., and Van Durme, B. (2020). “Which* bert? a survey organizing contextualized encoders”. In:arXivpreprint arXiv:2010.00854.Ye, X., Li, S., Yang, X., and Qin, C. (2016). “Use of social media for the detection and analysis of infectious diseasesin China”. In:ISPRS International Journal of Geo-Information5.9, p. 156.Zhang, Y., Jin, R., and Zhou, Z.-H. (2010). “Understanding bag-of-words model: a statistical framework”. In:International Journal of Machine Learning and Cybernetics1.1-4, pp. 43–52.