<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>1. Understanding LLMs &amp; Text Generation | GenAI Learning</title>
    <link>https://genai.gitpull.in/3-llm-and-text-gen/index.html</link>
    <description>How LLMs Generate Text LLMs don’t “think” like humans. They predict the most probable next word (token) based on previous words.&#xA;Step 1: Convert Text to Tokens Example (Word-based tokenization):&#xA;Sentence: &#34;The cat sat on the mat.&#34; Tokens: [&#34;The&#34;, &#34;cat&#34;, &#34;sat&#34;, &#34;on&#34;, &#34;the&#34;, &#34;mat&#34;, &#34;.&#34;] Example (Sub-word tokenization, used in LLaMA models):&#xA;Sentence: &#34;Artificial intelligence&#34; Tokens: [&#34;Art&#34;, &#34;ificial&#34;, &#34;intelli&#34;, &#34;gence&#34;] Why sub-word tokenization?&#xA;Handles new words by breaking them into smaller known parts. Reduces vocabulary size, improving efficiency. Step 2: Assign Probability to Next Token Example: Predicting the next token for the phrase: &#34;The capital of France is&#34;</description>
    <generator>Hugo</generator>
    <language>en-us</language>
    <atom:link href="https://genai.gitpull.in/3-llm-and-text-gen/index.xml" rel="self" type="application/rss+xml" />
  </channel>
</rss>