Weakly-Supervised Temporal Localization via Occurrence Count Learning
暂无分享,去创建一个
A. David Marshall | Julien Schroeter | Kirill A. Sidorov | A. D. Marshall | J. Schroeter | D. Marshall | K. Sidorov
[1] Yoshua Bengio,et al. Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.
[2] E. B. Newman,et al. A Scale for the Measurement of the Psychological Magnitude Pitch , 1937 .
[3] Jun S. Liu,et al. Weighted finite population sampling to maximize entropy , 1994 .
[4] Gerhard Widmer,et al. On the Potential of Simple Framewise Approaches to Piano Transcription , 2016, ISMIR.
[5] Bohyung Han,et al. Weakly Supervised Action Localization by Sparse Temporal Pooling Network , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[6] Christopher D. Manning,et al. Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.
[7] Geoffrey E. Hinton,et al. Reducing the Dimensionality of Data with Neural Networks , 2006, Science.
[8] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[9] Geoffrey E. Hinton,et al. Visualizing Data using t-SNE , 2008 .
[10] Juan Carlos Niebles,et al. Unsupervised Learning of Human Action Categories , 2006 .
[11] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..
[12] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[13] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[14] Jun S. Liu,et al. STATISTICAL APPLICATIONS OF THE POISSON-BINOMIAL AND CONDITIONAL BERNOULLI DISTRIBUTIONS , 1997 .
[15] Alex Graves,et al. Recurrent Models of Visual Attention , 2014, NIPS.
[16] Juan Carlos Niebles,et al. Unsupervised Learning of Human Action Categories Using Spatial-Temporal Words , 2008, International Journal of Computer Vision.
[17] Yong Xu,et al. Large-Scale Weakly Supervised Audio Classification Using Gated Convolutional Neural Network , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[18] Yong Xu,et al. A joint detection-classification model for audio tagging of weakly labelled data , 2016, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[19] M. Fernandez,et al. Closed-Form Expression for the Poisson-Binomial Probability Density Function , 2010, IEEE Transactions on Aerospace and Electronic Systems.
[20] V. V. Petrov. On lower bounds for tail probabilities , 2007 .
[21] Daniel Gärtner,et al. Real-Time Transcription and Separation of Drum Recordings Based on NMF Decomposition , 2014, DAFx.
[22] Gerhard Widmer,et al. A Review of Automatic Drum Transcription , 2018, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[23] Yi-Hsuan Yang,et al. Event Localization in Music Auto-tagging , 2016, ACM Multimedia.
[24] Cordelia Schmid,et al. Weakly Supervised Action Labeling in Videos under Ordering Constraints , 2014, ECCV.
[25] Honglak Lee,et al. Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations , 2009, ICML '09.
[26] D. Hilbert. Ueber die stetige Abbildung einer Line auf ein Flächenstück , 1891 .
[27] Roland Badeau,et al. Multipitch Estimation of Piano Sounds Using a New Probabilistic Spectral Smoothness Principle , 2010, IEEE Transactions on Audio, Speech, and Language Processing.
[28] Carsten Rother,et al. Weakly supervised discriminative localization and classification: a joint learning process , 2009, 2009 IEEE 12th International Conference on Computer Vision.
[29] Fei-Fei Li,et al. ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.
[30] Tuomas Virtanen,et al. Sound event detection using weakly labeled dataset with stacked convolutional and recurrent neural network , 2017, ArXiv.
[31] Jason Hockman,et al. Automatic Drum Transcription Using Bi-Directional Recurrent Neural Networks , 2016, ISMIR.
[32] L. L. Cam,et al. An approximation theorem for the Poisson binomial distribution. , 1960 .
[33] Bhiksha Raj,et al. Audio Event Detection using Weakly Labeled Data , 2016, ACM Multimedia.
[34] M. Gail,et al. Likelihood calculations for matched case-control studies and survival studies with tied death times , 1981 .
[35] Peter Knees,et al. Drum transcription from polyphonic music with recurrent neural networks , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[36] Kyogu Lee,et al. Ensemble of Convolutional Neural Networks for Weakly-supervised Sound Event Detection Using Multiple Scale Input , 2017, DCASE.
[37] Gaël Richard,et al. ENST-Drums: an extensive audio-visual database for drum signals processing , 2006, ISMIR.
[38] Jean Ponce,et al. Automatic annotation of human actions in video , 2009, 2009 IEEE 12th International Conference on Computer Vision.
[39] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.
[40] Yong Jae Lee,et al. Hide-and-Seek: Forcing a Network to be Meticulous for Weakly-Supervised Object and Action Localization , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[41] B. Roos. Binomial Approximation to the Poisson Binomial Distribution: The Krawtchouk Expansion , 2001 .
[42] Norman Breslow,et al. Discussion of Professor Cox''s paper , 1974 .
[43] G. Peano. Sur une courbe, qui remplit toute une aire plane , 1890 .
[44] Peter Knees,et al. Recurrent Neural Networks for Drum Transcription , 2016, ISMIR.
[45] Alexei A. Efros,et al. Unsupervised Visual Representation Learning by Context Prediction , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[46] Juergen Gall,et al. Weakly Supervised Action Learning with RNN Based Fine-to-Coarse Modeling , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[47] Lei Zhang,et al. AutoLoc: Weakly-supervised Temporal Action Localization , 2018, ECCV.
[48] Yi Yang,et al. DevNet: A Deep Event Network for multimedia event detection and evidence recounting , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[49] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.
[50] Jason Hockman,et al. Automatic Drum Transcription for Polyphonic Recordings Using Soft Attention Mechanisms and Convolutional Neural Networks , 2017, ISMIR.
[51] Juan Carlos Niebles,et al. Connectionist Temporal Modeling for Weakly Supervised Action Labeling , 2016, ECCV.
[52] Trevor Hastie,et al. The Elements of Statistical Learning , 2001 .
[53] M. Verleysen,et al. Classification in the Presence of Label Noise: A Survey , 2014, IEEE Transactions on Neural Networks and Learning Systems.
[54] Colin Raffel,et al. Onsets and Frames: Dual-Objective Piano Transcription , 2017, ISMIR.
[55] Luc Van Gool,et al. UntrimmedNets for Weakly Supervised Action Recognition and Detection , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[56] Qiang Huang,et al. Attention and Localization Based on a Deep Convolutional Recurrent Model for Weakly Supervised Audio Tagging , 2017, INTERSPEECH.
[57] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[58] Juan Carlos Niebles,et al. Modeling Temporal Structure of Decomposable Motion Segments for Activity Classification , 2010, ECCV.
[59] Jürgen Schmidhuber,et al. Deep learning in neural networks: An overview , 2014, Neural Networks.
[60] Simon Dixon,et al. An End-to-End Neural Network for Polyphonic Piano Music Transcription , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[61] Pietro Perona,et al. Object class recognition by unsupervised scale-invariant learning , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..
[62] Jürgen Schmidhuber,et al. Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks , 2006, ICML.
[63] Apostol Natsev,et al. YouTube-8M: A Large-Scale Video Classification Benchmark , 2016, ArXiv.
[64] Soumith Chintala,et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.
[65] D. Hilbert. Über die stetige Abbildung einer Linie auf ein Flächenstück , 1935 .
[66] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.
[67] R. A. Leibler,et al. On Information and Sufficiency , 1951 .
[68] Andrea Vedaldi,et al. Weakly Supervised Deep Detection Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).