AI101 - Hands-on

    Text

  1. Google translate , How It Works
  2. Traditional Natural Language Processing Rosette , OnToText , Deep Pavlov , Epsilon Sys , HuggingFace , Allen NLP , Stanford NLP , Explosion.AI - Spacy NER , Dandelion NER , How It Works
  3. Google Knowledge Graph , 101 , How It Works , 2
  4. Large Language Models ChatGPT , Perplexity.AI , Perplexity.AI Labs Playground , Google AI Studio , Google NotebookLM , Google Gemini , Anthropic Claude , Bing Chat , DeepSeek v3 , Lambda Chat , Qwen2-72B-Instruct , How It Works , 2 , PromptBook
  5. Write With Transformer , How It Works , 2 , 3 , 4 (BERT) , How It Works
  6. Text to Image MagicQuill , Google Labs - Image-FX , Tencent - Flux-Mini , Stability.AI , Clipdrop - Text to Image , PromptHero - Landscape Prompts , Freepik , Vecteezy , Google Gemini , Freepik - AI Image Generator , Nvidia - Sana , How It Works (Annotated) , 2 , List of tools for creating prompts for AI text-to-image generators ,
  7. Image to Image PhotoMaker , PhotoMaker , How It Works , 2
  8. Text to Music Text-to-Music samples with prompts , music audio generators ,
  9. Text To Video Runway.ML Gen-2 , Haiper , Sample Footages , How It Works , Text to Audio/Image/Video story-telling

    Image

  10. Image-2-Text - Moondream (Visual Q&A, Caption, Object Detection)
  11. HuggingFace - SmolVLM-500M
  12. Alibaba Qwen Chat
  13. Image-2-Text - Microsoft Florence 2 (Caption)
  14. Image-2-Text - AllenAI MolMo
  15. VideoLLaMA 3: Frontier Multimodal Foundation Models for Video Understanding , Video Understanding
  16. Segment-Anything , How It Works , 2 , 3
  17. Recognize Anything , How It Works
  18. LLaVA: Large Language and Vision Assistant , How It Works
  19. Image - Segmentation

  20. Hugging Face Group Face Detection , Emotion Recognition , ViTPose , Image SuperResolution , Students studying , Image Restoration - How It Works ,

  21. Synthetic Data (to test Fake News Detection algorithms) GAN - This Person Does Not Exist , GAN - This X Does Not Exist , How It Works , 2
  22. Single Digit Handwriting
  23. Stable doodle
  24. Google Labs Whisk (not available in SG)
  25. AutoDraw , Google Quickdraw , How It Works , 2
  26. Remove Image Background removal.ai , remove.bg
  27. Anime GAN
  28. Floorplan (Stanislas Chaillou)
  29. StreetClip

    Video (requires WebCam)

  30. PoseNet , BodyPix , Pose based Video Retrieval , How It Works , MeshCapade - How It Works
  31. 3D PoseNet , demo 2 , How It Works , 2
  32. Google Creative Lab - PoseNet
  33. Pose Animator
  34. TREX-2 - Object Counting
  35. Google MediaPipe mediapipe , MediaPipe FaceMesh Tracking , MediaPipe Hand Tracking , HandTrack , How It Works , 2 , 3 , 4 mediapipe-for-dummies , hand tracking code ,

    Audio - Speech / Music

  36. Text-To-Speech Hume.AI , Hume.AI - TikTok Fashion Influencer , ElevenLabs , Microsoft - EdgeTTS , Kokoro-TTS , ChatTTS + OpenVoice , Play.AI - 2 Speakers , FreePik - voice-generator , How It Works 1 , Speechify , Natural Readers , How It Works 2
  37. Voice Cloning llasa-3b-tts , CoQui XTTS , Sample 1 , Sample 2
  38. Text-To-Song Mureka.AI , Music.FX , Jammable , Suno , Tensorflow - Magenta , StableAudio, How It Works
  39. Text-To-Sound Effects ElevenLabs ,
  40. Yamaha vocaloid
  41. AI Cover Maroon 5: Memories SpongeBob , Sun YanZi - Mayday , Sun YanZi - Love Story ,

    3D

  42. StabilityAI/stable-point-aware-3d (SPAR3D) , How It Works
  43. StabilityAI/stable-fast-3d (2D image to 3D)
  44. Tencent - InstantMesh , Tencent - Hunyuan3D-2
  45. 2D-3D Microsoft TRELLIS

    Reading Material

  46. DSTA Medium Page , AI Generated Content MindMap
  47. Machine Learning for Everyone
  48. Deep Learning Book
  49. Stanford CS229 - Machine Learning Course CS231n: Convolutional Neural Networks for Visual Recognition
  50. CV Detailed Guide to Understand and Implement ResNets , Start Here with Computer Vision, Deep Learning, and OpenCV
  51. Google Machine Learning Crash Course
  52. Machine Learning Systems Design
  53. Machine Learning Visual Notes
  54. Stanford Speech and Language Processing
  55. Tools and Resources for AI Art
  56. Awesome AI/ML
  57. Yann Lecun , Deep Learning Hardware: Past, Present, and Future

    Hands-on

  58. Google machine-learning crash-course
  59. 100 Days of Machine Learning Coding
  60. AI in the Browser
  61. PYImageSearch - Python & CV
  62. Google AI Experiments
  63. Google TeachableMachine
  64. Knowledge Graph - Mine Information from Text
  65. Fast.AI Practical Deep Learning for Coders , Book
  66. Information Visualization
  67. Image To Image , Tensorflow Pix2Pix , How It Works
  68. creativitywith.ai
  69. Clipdrop - Stable Diffusion
  70. Stable Diffusion - Cloud-Services BaseTen , RunComfy , Replicate , RunDiffusion , RunPod , ThinkDiffusion ,
  71. Stable Diffusion - ComfyUI OpenArt.AI Templates , OpenArt.AI Contest-Winners , OpenArt.AI Templates ,

    Others

  72. Meta AI demos
  73. Digital Humans HugginFace Sad Talker , HeyGen , synthesia.io , D-ID Chat , DeepMotion , Leonardo.AI , shakker.AI , Unreal Metahuman , How It Works , 2 , 3 , LivePortrait , LivePortrait Demo 1 , LivePortrait Demo 2
  74. prodi.gy named-entity-recognition
  75. ML5js
  76. Oxford - Visual Geometry Group
  77. GAN Lab
  78. Machine Learning Canvas
  79. Artificial Intelligence for Humanitarian Assistance and Disaster Response (HADR) Workshop @ NeurIPS 2019
  80. TensorFlow JS
  81. 3D Mapping - Drone Skydio (SketchFab), Birmingham (SketchFab)
  82. Docubase Tools
  83. 2D photo to 3D NeRF in the Wild - Neural Radiance Fields for Unconstrained Photo Collections
  84. MLOps Community
  85. Google Maps 3D , 2
  86. Google Streeview , 2
  87. HTML Colors HTML Color Codes

    Videos

  88. Re-Work
  89. Two Minute Papers

    Datasets

  90. Exploring 12 Million of the 2.3 Billion Images Used to Train Stable Diffusion's Image Generator , Exploring the training data behind Stable Diffusion
  91. DataSetList Machine Learning
  92. Google Dataset Search
  93. AMiner Datasets
  94. CV - Event Recognition in Aerial Videos
  95. Benchmarking Measuring the Progress of AI Research , Github , Chatbot Arena , Artificial Analysis ,
  96. Synthetic Data MakeSense.AI (label photos)
  97. Microsoft Designer (sign-in with Microsoft account)
  98. Sample Sounds Google AudioSet , FreeSound , LAION Audio Dataset Project , VGG-Sound , AudioCaps , WavCaps ,
  99. Sample Text Chinese News - ZaoBao
  100. Sample Video FreePik - People , FreePik - Nature , PictoGraphic , UnSplash , PixaBay , Pexels , StockSnap.io , PicJumbo , Flickr , Adobe Stock Collection , Shopify Burst ,
  101. Sample Videos for PoseNet Dance Videos for PoseNet 1 , Dance Videos for PoseNet 2 , BTS Fire , GD X TAEYANG 'Good Boy' mirrored Dance Practice , Meghan Trainor - NO / Learner's Class , Maroon 5 - Memories / Woomin Jang Choreography for PoseNet 3