Mining naturalistic human behaviors in long-term video and neural recordings