Breiman L (1996) Bagging predictors. Mach Learn 24:123–140

Google Scholar

Clevert D, Unterthiner T, Hochreiter S (2015) Fast and accurate deep network learning by exponential linear units (ELUs). Paper presented at international conference on learning representation (ICLR) 2016, Caribe Hilton, San Juan, 2–4 May 2016

DeVries T, Taylor GW (2017) Improved regularization of convolutional neural networks with cutout. arXiv:1708.04552

Frank E, Hal M (2001) A simple approach to ordinal classification. Paper presented at the 12th European Conference on Machine Learning ECML) 2001, Freiburg, 5–7 September 2001

Freund Y, Schapire RE (1997) A decision-theoretic generalization of on-line learning and an application to boosting. J Comput Syst Sci 55(1):119–139

Article
Google Scholar

Gastaldi X (2017) Shake-shake regularization of 3-branch residual networks. Paper presented at international conference on learning representation (ICLR) 2017, Palais des Congrès Neptune, Toulon, 24–26 April 2017

Goodfellow I, Bengio Y, Courville A (2016) Deep learning. The MIT Press, Cambridge

Google Scholar

Han D, Kim J, Kim J (2017) Deep pyramidal residual networks. Paper presented at conference on computer vision and pattern recognition (CVPR) 2017, Hawaii Convention Center, Honolulu, 21–26 July 2017

He H, Garcia EA (2009) Learning from imbalanced data. IEEE Trans Knowl Data Eng 21(9):1263–1284

He K, Zhang X, Ren S, Sun J (2015) Delving deep into rectifiers: surpassing human-level performance on ImageNet classification. Paper presented at international conference on computer vision (ICCV) 2015, CentroParque Convention Center, Santiago, 13–16 December 2015

He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. Paper presented at conference on computer vision and pattern recognition (CVPR) 2016, Caesar's Palace, Las Vegas, 26 June–1 July 2016

Hinton GE, Dayan P, Frey BJ, Neal RM (1995) The “wake-sleep” algorithm for unsupervised neural networks. Science 268(5214):1158–1161

Article
Google Scholar

Hochreiter S (1998) The vanishing gradient problem during learning recurrent neural nets and problem solutions. Int J Uncertain Fuzziness Knowl Based Syst 6(2):107–116

Article
Google Scholar

Hu Y, Huber A, Anumula J, Liu SC (2018) Overcoming the vanishing gradient problem in plain recurrent networks. arXiv:1801.06105

Kabari LG, Onwuka U (2019) Comparison of bagging and voting ensemble machine learning algorithm as a classifier. Int J Comput Sci Softw Eng 9(3):19–23

Kim SK, Ames S, Lee J, Zhang C, Wilson AC, Williams D (2017) In: Ebert-Uphoff I, Monteleoni C, Nychka D (eds) Massive scale deep learning for detecting extreme climate events. Proceedings of the 7th international workshop on climate informatics, Boulder 2017

Kim SK, Park S, Chung S, Lee J, Lee Y, Kim H, Prabhat, Choo J (2019) Learning to focus and track extreme climate events. Paper presented at the 30th British machine vision conference (BMVC) 2019, Cardiff University, Cardiff, 9–12 September 2019

Kodama C, Yamada Y, Noda AT, Kikuchi K, Kajikawa Y, Nasuno T, Tomita T, Yamaura T, Takahashi HG, Hara M, Kawatani Y, Satoh M, Sugi M (2015) A 20-year climatology of a NICAM AMIP-type simulation. J Meteorol Soc Jpn 93(4):393–424

Article
Google Scholar

Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks. Adv Neural Inf Process Syst 60:84–90

Google Scholar

LeCun Y, Bengio Y (1995) Convolutional networks for images, speech, and time-series. In: Arbib MA (ed) The hand book of brain theory and neural networks. MIT Press, Cambridge

Google Scholar

Leevy JL, Khoshgoftaar TM, Bauder RA, Seliya N (2018) A survey on addressing high-class imbalance in big data. J Big Data 5(1):42. https://doi.org/10.1186/s40537-018-0151-6

Article
Google Scholar

Leon F, Floria S-A, Bădică C (2017) Evaluating the effect of voting methods on ensemble-based classification. Paper presented at international conference on innovations in intelligent systems and applications (INSTA) 2017, Nadmorski Hotel, Gdynia, 3–5 July 2017

Li Y, Song Y, Luo J (2017) Improving pairwise ranking for multi-label image classification. Paper presented at conference on computer vision and pattern recognition (CVPR) 2017, Hawaii Convention Center, Honolulu, 21–26 July 2017

Lin TY, Goyal P, Girshick R, He K, Dollár P (2020) Focal loss for dense object detection. IEEE Trans Pattern Anal Mach Intell 42(2):318–327

Article
Google Scholar

Liu Y, Racah E, Prabhat, Correa J, Khosrowshahi A, Lavers D, Kunkel K, Wehner M, Collins W (2016) Application of deep convolutional neural networks for detecting extreme weather in climate datasets. arXiv reprint arXiv:1605.01156

Maas AL, Hannun AY, Ng AN (2013) Paper presented at international conference on machine learning (ICML) 2013, Atlanta Marriott Marquis, Atlanta, 16–21 June 2013

Martins AFT, Astudillo RF (2016) From softmax to sparse-max: a sparse model of attention and multi-label classification. Paper presented at international conference on machine learning (ICML) 2016, Marriott Marquis hotel, New York City, 19–24 June 2016

Matsuoka D, Nakano M, Sugiyama D, Uchida S (2017) In: Ebert-Uphoff I, Monteleoni C, Nychka D (eds) Detecting precursors of tropical cyclone using deep neural networks. In: Proceedings of the 7th international workshop on climate informatics, Boulder 2017

Matsuoka D, Nakano M, Sugiyama D, Uchida S (2018) Deep learning approach for detecting tropical cyclones and their precursors in the simulation by a cloud-resolving global nonhydrostatic atmospheric model. Prog Earth Planet Sci. https://doi.org/10.1186/s40645-018-0245-y

Article
Google Scholar

Nair V, Hinton GE (2010) Rectified linear units improve restricted Boltzmann machines. Paper presented at international conference on machine learning (ICML) 2010, Haifa Congress Center, Haifa, 21–24 June 2010

Nakano M, Sawada M, Nasuno T, Satoh M (2015) Intraseasonal variability and tropical cyclogenesis in the Western North Pacific simulated by a global nonhydrostatic atmospheric model. Geophys Res Lett 42(2):565–571

Pradhan R, Aygun RS, Maskey M, Ramachandran R, Cecil DJ (2018) Tropical cyclone intensity estimation using a deep convolutional neural network. IEEE Trans Image Process 27(2):692–702

Rasp S, Dueben PD, Scher S, Weyn JA, Mouatadid S, Thuerey N (2020) WeatherBench: a benchmark data set for data-driven weather forecasting. J Adv Model Earth Syst. https://doi.org/10.1029/2020MS002203

Article
Google Scholar

Rumelhart DE, Hinton GE, Williams RJ (1986) Learning representations by back-propagating errors. Nature 323(6088):533–538

Article
Google Scholar

Sandler M, Howard A, Zhu M, Zhmoginov A, Chen L-C (2018) MobileNetV2: Inverted residuals and linear bottlenecks. Paper presented at conference on computer vision and pattern recognition (CVPR) 2018, Calvin L. Rampton Salt Palace Convention Center, Salt Lake City, 18–22 June 2018

Shorten C, Khoshgoftaar TM (2019) A survey on image data augmentation for deep learning. J Big Data 6(1):60. https://doi.org/10.1186/s40537-019-0197-0

Article
Google Scholar

Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. Paper presented at international conference on learning representation (ICLR) 2015, The Hilton San Diego Resort & Spa, San Diego, 7–9 May 2015

Sugi M, Noda A, Sato N (2002) Influence of the global warming on tropical cyclone climatology: an experiment with the JMA global model. J Meteorol Soc Jpn 80(2):249–272

Article
Google Scholar

Sun Y, Wong AKC, Kamel MS (2009) Classification of imbalanced data: a review. Int J Pattern Recognit Artif Intell 23(04):687–719

Sun Y, Wang X, Tang X (2014) Deeply learned face representations are sparse, selective, and robust. Paper presented at conference on computer vision and pattern recognition (CVPR) 2015, Hynes Convention Center, Boston, 7–12 June 2015

Wei J, Suriawinata A, Vaickus L, Ren B, Liu X, Wei J, Hassanpour S (2020) Generative image translation for data augmentation in colorectal histopathology images. Paper at the thirty-third annual conference on neural information processing systems (NeurIPS) 2019, Vancouver Convention Center, Vancouver, 8–14 December 2019

Xu B, Wang N, Chen T, Li M (2015) Empirical evaluation of rectified activations in convolutional network. CoRR [Abs.]/1505.00853

Yamada Y, Satoh M, Sugi M, Kodama C, Noda AT, Nakano M, Nasuno T (2017) Response of tropical cyclone activity and structure to global warming in a high-resolution global nonhydrostatic model. J Clim 30(23):9703–9724

Article
Google Scholar

Zagoruyko S, Komodakis N (2016) Wide residual networks. Paper presented at British machine vision conference BMVC) 2016, York University, York, 19–22 September 2016

Zhang H, Cisse M, Dauphin YN, Lopez-Paz D (2018) mixup: beyond empirical risk minimization. Paper presented at international conference on learning representation (ICLR) 2018, Vancouver Convention Center, Vancouver. 30 April–3 May 2018

Zhong Z, Zheng L, Kang G, Li S, Yang Y (2020) Random erasing data augmentation. Paper presented at Conference on Artificial Intelligence (AAAI) 2020, Hilton New York Midtown, New York, 7–12 July 2020