The co-op bookstore for avid readers
Book Cover for: The Handbook of NLP with Gensim: Leverage topic modeling to uncover hidden patterns, themes, and valuable insights within textual data, Chris Kuo

The Handbook of NLP with Gensim: Leverage topic modeling to uncover hidden patterns, themes, and valuable insights within textual data

Chris Kuo

Elevate your natural language processing skills with Gensim and become proficient in handling a wide range of NLP tasks and projects

Key Features
  • Advance your NLP skills with this comprehensive guide covering detailed explanations and code practices
  • Build real-world topical modeling pipelines and fine-tune hyperparameters to deliver optimal results
  • Adhere to the real-world industrial applications of topic modeling in medical, legal, and other fields
  • Purchase of the print or Kindle book includes a free PDF eBook
Book Description

Navigating the terrain of NLP research and applying it practically can be a formidable task made easy with The Handbook of NLP with Gensim. This book demystifies NLP and equips you with hands-on strategies spanning healthcare, e-commerce, finance, and more to enable you to leverage Gensim in real-world scenarios.

You'll begin by exploring motives and techniques for extracting text information like bag-of-words, TF-IDF, and word embeddings. This book will then guide you on topic modeling using methods such as Latent Semantic Analysis (LSA) for dimensionality reduction and discovering latent semantic relationships in text data, Latent Dirichlet Allocation (LDA) for probabilistic topic modeling, and Ensemble LDA to enhance topic modeling stability and accuracy.

Next, you'll learn text summarization techniques with Word2Vec and Doc2Vec to build the modeling pipeline and optimize models using hyperparameters. As you get acquainted with practical applications in various industries, this book will inspire you to design innovative projects. Alongside topic modeling, you'll also explore named entity handling and NER tools, modeling procedures, and tools for effective topic modeling applications.

By the end of this book, you'll have mastered the techniques essential to create applications with Gensim and integrate NLP into your business processes.

What you will learn
  • Convert text into numerical values such as bag-of-word, TF-IDF, and word embedding
  • Use various NLP techniques with Gensim, including Word2Vec, Doc2Vec, LSA, FastText, LDA, and Ensemble LDA
  • Build topical modeling pipelines and visualize the results of topic models
  • Implement text summarization for legal, clinical, or other documents
  • Apply core NLP techniques in healthcare, finance, and e-commerce
  • Create efficient chatbots by harnessing Gensim's NLP capabilities
Who this book is for

This book is for data scientists and professionals who want to become proficient in topic modeling with Gensim. NLP practitioners can use this book as a code reference, while students or those considering a career transition will find this a valuable resource for advancing in the field of NLP. This book contains real-world applications for biomedical, healthcare, legal, and operations, making it a helpful guide for project managers designing their own topic modeling applications.

Table of Contents
  1. Introduction to NLP
  2. Word Embedding
  3. Text Wrangling and Preprocessing
  4. Latent Semantic Analysis with scikit-learn
  5. Cosine Similarity
  6. Latent Semantic Indexing with Gensim
  7. Using Word2Vec
  8. Doc2Vec with Gensim
  9. Understanding Discrete Distributions
  10. Latent Dirichlet Allocation
  11. LDA Modeling
  12. LDA Visualization
  13. The Ensemble LDA for Model Stability
  14. LDA and BERTopic
  15. Real-World Use Cases

Book Details

  • Publisher: Packt Publishing
  • Publish Date: Oct 27th, 2023
  • Pages: 310
  • Language: English
  • Edition: undefined - undefined
  • Dimensions: 9.25in - 7.50in - 0.65in - 1.18lb
  • EAN: 9781803244945
  • Categories: Computer ScienceData Science - GeneralArtificial Intelligence - General

More books to explore

Book Cover for: The Book of Why: The New Science of Cause and Effect, Judea Pearl
Book Cover for: Ethical Machines: Your Concise Guide to Totally Unbiased, Transparent, and Respectful AI, Reid Blackman
Book Cover for: Journey of the Mind: How Thinking Emerged from Chaos, Ogi Ogas
Book Cover for: The Digital Mindset: What It Really Takes to Thrive in the Age of Data, Algorithms, and AI, Paul Leonardi
Book Cover for: Meganets: How Digital Forces Beyond Our Control Commandeer Our Daily Lives and Inner Realities, David B. Auerbach
Book Cover for: How Data Happened: A History from the Age of Reason to the Age of Algorithms, Chris Wiggins
Book Cover for: Genius Makers: The Mavericks Who Brought AI to Google, Facebook, and the World, Cade Metz
Book Cover for: You Look Like a Thing and I Love You: How Artificial Intelligence Works and Why It's Making the World a Weirder Place, Janelle Shane
Book Cover for: Nine Algorithms That Changed the Future: The Ingenious Ideas That Drive Today's Computers, John Maccormick
Book Cover for: The Age of AI: And Our Human Future, Henry a. Kissinger
Book Cover for: The Handover: How We Gave Control of Our Lives to Corporations, States and Ais, David Runciman
Book Cover for: AI Superpowers: China, Silicon Valley, and the New World Order, Kai-Fu Lee
Book Cover for: The Loop: How AI Is Creating a World Without Choices and How to Fight Back, Jacob Ward
Book Cover for: The Myth of Artificial Intelligence: Why Computers Can't Think the Way We Do, Erik J. Larson
Book Cover for: Literary Theory for Robots: How Computers Learned to Write, Dennis Yi Tenen

About the Author

Kuo, Chris: - Chris Kuo is a data scientist with over 23 years of experience. He led various data science solutions including customer analytics, health analytics, fraud detection, and litigation. He is also an inventor of a U.S. patent. He has worked at several Fortune 500 companies in the insurance and retail industries.

Chris teaches at Columbia University and has taught at Boston University and other universities. He has published articles in economic and management journals and served as a journal reviewer. He is the author of the eXplainable A.I., Modern Time Series Anomaly Detection, Transfer Learning for Image Classification, and The Handbook of Anomaly Detection. He received his undergraduate degree in Nuclear Engineering and Ph.D. in Economics.