Skip to content

project303/Having-Fun-with-NLP

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

34 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Having Fun with NLP

Text Mining 301

  1. Lab01 - Installation and Config
  2. Lab02 - Tokenization and Cleansing
  3. Lab03 - Stop Words Removal
  4. Lab04 - Stemming
  5. Lab05 - TF-IDF
  6. Lab06 - Text Clustering - Putting It All Together

Link presentasi:

  1. Presentasi

Resources untuk NLP bahasa Indonesia

  1. wortschatz.uni-leipzig.de - Kumpulan berita, artikel, artikel dari berbagai web berita, wikipedia tahun 2008-2016
  2. tanzil.net - Terjemahan Al Quran
  3. ilps.science.uva.nl - Korpus berita dan artikel Kompas dan Tempo online tahun 2001 - 2002
  4. Opus NLPL - Kumpulan terjemahan teks dalam berbagai bahasa
  5. Rio Chandra - Dataset sentimen analisis
  6. Sastrawi - Daftar kata-kata yang digunakan oleh sastrawi
  7. Rama Prakoso - Daftar kata-kata untuk menentukan sentimen
  8. Prasasto Adi - Daftar kata-kata untuk menentukan sentimen
  9. Rio Chandra - Daftar kata-kata untuk menentukan sentimen
  10. Rama Prakoso - Daftar singkatan yang sering digunakan
  11. Yohanes Gultom - POS tag dan NER bahasa Indonesia
  12. Yusuf Syaifudin - POS tag dan NER bahasa Indonesia
  13. Reza Dwi Utomo - spaCy untuk training model NER menggunakan anotasi bahasa Indonesia
  14. Fam Rashel - POS tag bahasa Indonesia
  15. Louis Owen - Kumpulan resources NLP bahasa Indonesia terkurasi
  16. Ismail Fahmi - Inside Drone Emprit: Natural Processing, Sentiment Analysis, Emotion Analysis
  17. Hate Speech Data
  18. The Seven Practice Areas of Text Analytics - The seven text mining practice areas exist at the major intersections of text mining with its six related fields

Free Course

  1. Free LLM Course - Free LLM Courses

Youtube Video untuk belajar LLM

  1. Intro to Large Language Models - Andrej Karpathy
  2. Attention Is All You Need - Yannic Kilcher
  3. Stanford CS224N NLP with Deep Learning - Standford Online
  4. Non-Technical Intro to Generative AI - freeCodeCamp.org

Awesome NLP and LLM Dataseet

  1. LLM DataHub Datasets for LLM training.
  2. Awesome LLMs Datasets LLM Datasets and research papers
  3. LLM Datasets High-quality datasets, tools, and concepts for LLM fine-tuning
  4. Awesome Public Datasets This list of topic-centric public data sources is of high quality.
  5. NLP Datasets Alphabetical list of free/public domain datasets with text data for use in NLP.
  6. Awesome Dataset Tools A curated list of awesome dataset tools.
  7. Awesome time series database A curated list of time series databases.
  8. Awesome Cybersecurity Datasets A curated list of amazingly awesome Cybersecurity datasets.
  9. Awesome Robotics Datasets Robotics Dataset Collections.

About

Tempat belajar NLP

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published