论文信息 - EdVidParse : detecting people and content in educational videos

EdVidParse : detecting people and content in educational videos

Thesis: M. Eng. in Computer Science and Engineering, Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2015.

Michele Pratusevich

[1] Krista A. Ehinger,et al. SUN Database: Exploring a Large Collection of Scene Categories , 2014, International Journal of Computer Vision.

[2] Bernt Schiele,et al. What Makes for Effective Detection Proposals? , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3] Kazuaki Kishida. Property of average precision and its generalization: An examination of evaluation indicator for information retrieval experiments , 2005 .

[4] Krzysztof Z. Gajos,et al. Understanding in-video dropouts and interaction peaks in online lecture videos Citation , 2014 .

[5] C. Lawrence Zitnick,et al. Structured Forests for Fast Edge Detection , 2013, 2013 IEEE International Conference on Computer Vision.

[6] Luc Van Gool,et al. The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[7] Björn Hartmann,et al. Video digests: a browsable, skimmable format for informational lecture videos , 2014, UIST.

[8] Lawrence D. Jackel,et al. Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.

[9] Antonio Torralba,et al. Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[10] Sara T Itani. EduCase : an automated lecture video recording, post-processing, and viewing system that utilizes multimodal inputs to provide a dynamic student experience , 2013 .

[11] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[12] C. Lawrence Zitnick,et al. Edge Boxes: Locating Object Proposals from Edges , 2014, ECCV.

[13] C. Lawrence Zitnick,et al. Fast Edge Detection Using Structured Forests , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14] Bill Triggs,et al. Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[15] Rebecca Krosnick,et al. VideoDoc : combining videos and lecture notes for a better learning experience , 2015 .

[16] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[17] Jasper Snoek,et al. Bayesian Optimization and Semiparametric Models with Applications to Assistive Technology , 2014 .

[18] Jasper Snoek,et al. Input Warping for Bayesian Optimization of Non-Stationary Functions , 2014, ICML.

[19] Chih-Jen Lin,et al. LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[20] Krzysztof Z. Gajos,et al. Learnersourcing Subgoal Labels for How-to Videos Citation , 2014 .

[21] Trevor Darrell,et al. Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[22] David A. McAllester,et al. Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23] Jasper Snoek,et al. Bayesian Optimization with Unknown Constraints , 2014, UAI.

[24] Dumitru Erhan,et al. Scalable Object Detection Using Deep Neural Networks , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[25] Chih-Jen Lin,et al. LIBSVM: A library for support vector machines , 2011, TIST.

[26] Jasper Snoek,et al. Multi-Task Bayesian Optimization , 2013, NIPS.

[27] Yoshua Bengio,et al. How transferable are features in deep neural networks? , 2014, NIPS.

[28] Patrick Jermann,et al. How Students Learn using MOOCs: An Eye-tracking Insight , 2014 .

[29] Harald Sack,et al. Lecture Video Indexing and Analysis Using Video OCR Technology , 2011, 2011 Seventh International Conference on Signal Image Technology & Internet-Based Systems.

[30] Richard Szeliski,et al. High-quality video view interpolation using a layered representation , 2004, SIGGRAPH 2004.

[31] Cordelia Schmid,et al. A Spatio-Temporal Descriptor Based on 3D-Gradients , 2008, BMVC.

[32] Kunihiko Fukushima,et al. Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position , 1980, Biological Cybernetics.

[33] John Domingue,et al. Using Linked Data to Annotate and Search Educational Video Resources for Supporting Distance Learning , 2012, IEEE Transactions on Learning Technologies.

[34] Yoshua Bengio,et al. Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[35] Philip J. Guo,et al. How video production affects student engagement: an empirical study of MOOC videos , 2014, L@S.

[36] Bolei Zhou,et al. Learning Deep Features for Scene Recognition using Places Database , 2014, NIPS.

[37] Yuanyuan Wang,et al. An Exploratory Search for Presentation Contents based on Slide Semantic Structure , 2014, SEKE.

[38] Trevor Darrell,et al. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[39] Bolei Zhou,et al. Object Detectors Emerge in Deep Scene CNNs , 2014, ICLR.

[40] René F. Kizilcec,et al. Showing face in video instruction: effects on information retention, visual attention, and affect , 2014, CHI.

[41] Matthijs C. Dorst. Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[42] Trevor Darrell,et al. DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition , 2013, ICML.

[43] Jitendra Malik,et al. Analyzing the Performance of Multilayer Neural Networks for Object Recognition , 2014, ECCV.

[44] John Adcock,et al. TalkMiner: a search engine for online lecture video , 2010, ACM Multimedia.

[45] Jasper Snoek,et al. Practical Bayesian Optimization of Machine Learning Algorithms , 2012, NIPS.

[46] Tomaso A. Poggio,et al. Example-Based Learning for View-Based Human Face Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[47] Fei-Fei Li,et al. ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[48] Geoffrey E. Hinton,et al. Learning internal representations by error propagation , 1986 .

[49] Sanjay Goel,et al. LectureKhoj: Automatic tagging and semantic segmentation of online lecture videos , 2014, 2014 Seventh International Conference on Contemporary Computing (IC3).

[50] Koen E. A. van de Sande,et al. Selective Search for Object Recognition , 2013, International Journal of Computer Vision.