Research Scientist Intern, GenAI - Multimodal Audio (Speech, Sound and Music)
Meta

Menlo Park, California

Posted in Retail

$0.00 - $100.00 per hour


Job Info


Meta was built to help people connect and share, and over the last decade our tools have played a critical part in changing how people around the world communicate with one another. With over a billion people using the service and more than fifty offices around the globe, a career at Meta offers countless ways to make an impact in a fast growing organization.We are committed to advancing the field of artificial intelligence by making fundamental advances in technologies to help interact with and understand our world. We are seeking individuals passionate in areas such as deep learning, computer vision, audio and speech processing, natural language processing, machine learning, reinforcement learning, computational statistics, and applied mathematics. Our interns have an opportunity to make core algorithmic advances and apply their ideas at an unprecedented scale.The GenAI org at Meta builds industry leading LLM and multimodal generative foundation models, which sets the industry benchmark of open source foundation models and enables many Meta products.The hiring team is working on the industrial leading research on multimodal generative foundation models with a focus on the audio modality (including speech, sound and music). The team is working closely with the language and the vision research teams, and is collaborating with product teams in bringing the results to benefit billions of Meta users around the world.

Research Scientist Intern, GenAI - Multimodal Audio (Speech, Sound and Music) Responsibilities:

  • Full-life-cycle research on multimodal generative foundation models with a focus on the audio modality, including bringing up ideas, designing and implementing models and algorithms, curating training data, training / tuning / scaling the models, evaluating the performance, open sourcing and publication
  • Develop novel state-of-the-art machine learning algorithms and corresponding systems, leveraging various deep learning techniques
  • Analyze and improve efficiency, scalability, and stability of corresponding deployed algorithms
  • Perform research to advance the science and technology of intelligent machines
  • Collaborate with researchers and cross-functional partners including communicating research plans, progress, and results
  • Publish research results and contribute to research that can be applied to Meta product development


Minimum Qualifications:

  • Currently has or is in the process of obtaining a Ph.D. degree in Computer Science, Machine Learning, Artificial Intelligence, Robotics, Algorithms, Computational Mathematics, or relevant technical field.
  • Must obtain work authorization in country of employment at the time of hire and maintain ongoing work authorization during employment.
  • Research experience in machine learning, deep learning, computer vision and/or natural language processing.
  • Experience with Python, C++, C, Lua or other related language.
  • Experience with deep learning frameworks such as Pytorch or Tensorflow


Preferred Qualifications:

  • Intent to return to degree program after the completion of the internship/co-op
  • Experience in either audio dataset curation or audio generation model evaluation
  • Proven track record of achieving significant results as demonstrated by grants, fellowships, patents, as well as first-authored publications at leading workshops or conferences such as NeurIPS, ICML, ICLR, CVPR, ICCV, ECCV, ACL or similar
  • Experience working and communicating cross functionally in a team environment
  • Publications or experience in audio (speech, sound, or music) or vision (image or video) generative models.
  • Experience solving analytical problems using quantitative approaches.
  • Experience setting up ML experiments and analyze their results.
  • Experience manipulating and analyzing complex, large scale, high-dimensionality data from varying sources
  • Experience in utilizing theoretical and empirical research to solve problems.


About Meta:

Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. People who choose to build their careers by building with us at Meta help shape a future that will take us beyond what digital connection makes possible today-beyond the constraints of screens, the limits of distance, and even the rules of physics.

Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. Meta participates in the E-Verify program in certain locations, as required by law. Please note that Meta may leverage artificial intelligence and machine learning technologies in connection with applications for employment.

Meta is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance or accommodations due to a disability, please let us know at accommodations-ext@fb.com.



More Retail jobs


clairesinc
Folsom, California
Posted about 1 hour ago

clairesinc
Greenwood, Indiana
Posted about 1 hour ago

clairesinc
Front Royal, Virginia
Posted about 1 hour ago

Get Hired Faster

Subscribe to job alerts and upload your resume!

*By registering with our site, you agree to our
Terms and Privacy Policy.


Share diversity job

Research Scientist Intern, GenAI - Multimodal Audio (Speech, Sound and Music) is posted on all sites within our Diversity Job Network.


African American Job Search Logo
Hispanic Inclusion Jobs Logo
Asian Job Search Logo
Women Inclusion Jobs Logo
Diversity Inclusion Jobs Logo
Seniors to Work Logo
Black Inclusion Jobs Logo
Veteran Job Center Logo
LGBT Job Search Logo
Asian Inclusion Jobs Logo
Disabled Job Seekers Logo
Senior Inclusion Jobs Logo
Disability Inclusion Jobs Logo
US Diversity Job Search Logo
LGBTQ Inclusion Jobs Logo
Hispanic Job Exchange Logo