<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[Decoding AI Jargons with Chai]]></title><description><![CDATA[Decoding AI Jargons with Chai]]></description><link>https://decoding-the-jargons-of-ai-with-chai.hashnode.dev</link><generator>RSS for Node</generator><lastBuildDate>Fri, 26 Jun 2026 05:46:01 GMT</lastBuildDate><atom:link href="https://decoding-the-jargons-of-ai-with-chai.hashnode.dev/rss.xml" rel="self" type="application/rss+xml"/><language><![CDATA[en]]></language><ttl>60</ttl><item><title><![CDATA[Decoding AI Jargons with Chai]]></title><description><![CDATA[Ever wondered how ChatGPT or other AI tools magically understand what you're typing and respond almost like a human? It's not magic—it's math, models, and a lot of machine learning. One of the most important building blocks behind these tools is a sp...]]></description><link>https://decoding-the-jargons-of-ai-with-chai.hashnode.dev/decoding-ai-jargons-with-chai</link><guid isPermaLink="true">https://decoding-the-jargons-of-ai-with-chai.hashnode.dev/decoding-ai-jargons-with-chai</guid><category><![CDATA[Generative Pre Trained Transformer]]></category><category><![CDATA[GenAI Cohort]]></category><category><![CDATA[AI Jargon]]></category><category><![CDATA[ChaiCode]]></category><dc:creator><![CDATA[Rahul Kapoor]]></dc:creator><pubDate>Tue, 08 Apr 2025 05:28:49 GMT</pubDate><content:encoded><![CDATA[<p>Ever wondered how ChatGPT or other AI tools magically understand what you're typing and respond almost like a human? It's not magic—it's <strong>math, models, and a lot of machine learning</strong>. One of the most important building blocks behind these tools is a special model called a <strong>Transformer</strong>.</p>
<p>Let’s break it down in simple terms.</p>
<hr />
<h2 id="heading-what-is-a-transformer">🔄 What Is a Transformer?</h2>
<p>A <strong>Transformer</strong> is a type of model that reads and understands <strong>entire sentences all at once</strong>, rather than going word by word like earlier models did.</p>
<p>Imagine reading a whole paragraph before trying to answer a question about it. That’s what makes Transformers powerful—they understand <strong>context as a whole</strong>, not just in bits and pieces.</p>
<hr />
<h2 id="heading-step-1-tokenization-making-language-machine-readable">🧩 Step 1: Tokenization (Making Language Machine-Readable)</h2>
<p>When you type something like:</p>
<blockquote>
<p>“What’s the weather like in Delhi today?”</p>
</blockquote>
<p>An AI doesn’t understand this directly. The first step is to convert your sentence into <strong>tokens</strong>—small chunks of text that the machine can understand.</p>
<p>This process is called <strong>tokenization</strong>, and it’s done by a component called the <strong>encoder</strong>.</p>
<ul>
<li><p>Think of tokens as words or even parts of words, depending on the model.</p>
</li>
<li><p>These tokens are <strong>specific</strong> to the AI model you're using.</p>
</li>
<li><p>Different models = different token definitions.</p>
</li>
</ul>
<p>Once tokenized, these tokens are converted into <strong>vector embeddings</strong>—basically, numerical representations of words. These vectors capture the <strong>semantic meaning</strong> of the prompt.</p>
<hr />
<h2 id="heading-but-wait-what-about-word-order">🧭 But Wait! What About Word Order?</h2>
<p>Here’s a catch:<br />Two different sentences like:</p>
<ul>
<li><p>“The cat chased the dog.”</p>
</li>
<li><p>“The dog chased the cat.”</p>
</li>
</ul>
<p>...would generate <strong>similar tokens and vector embeddings</strong>, even though the meaning is <strong>completely different</strong>.</p>
<p>That’s where <strong>positional encoding</strong> comes in. It helps the model <strong>understand the order of words</strong>, ensuring that:</p>
<ul>
<li>“The cat chased the dog” ≠ “The dog chased the cat.”</li>
</ul>
<p>It’s like giving each word a GPS coordinate so the AI knows <em>where</em> each word sits in the sentence.</p>
<hr />
<h2 id="heading-self-attention-letting-words-talk-to-each-other">🧠 Self-Attention: Letting Words "Talk" to Each Other</h2>
<p>Words often change their meaning based on context. For example:</p>
<ul>
<li><p>"Apple" in “I ate an apple” = fruit</p>
</li>
<li><p>"Apple" in “Apple released a new iPhone” = company</p>
</li>
</ul>
<p>Even though the token is the same, the <strong>meaning is not</strong>.</p>
<p>That’s why we use something called a <strong>self-attention model</strong>, where tokens <strong>interact with each other</strong>. This helps them adjust their vector embeddings based on their surroundings—just like how humans interpret meaning from context.</p>
<p>To make this even more effective, models use a <strong>multi-head attention</strong> mechanism. It’s like having multiple “perspectives” or “views” on the same sentence to understand it deeply at different layers.</p>
<hr />
<h2 id="heading-enter-gpt-predicting-the-next-character">🔄 Enter GPT: Predicting the Next Character</h2>
<p>At the heart of models like ChatGPT (which stands for <strong>Generative Pre-Trained Transformer</strong>) is a simple goal:</p>
<blockquote>
<p><strong>Predict the next character or word</strong> you're about to type.</p>
</blockquote>
<p>But don’t mistake “simple” for “easy.”<br />GPT does this by being trained on <strong>massive amounts of data</strong>, learning how language works based on context.</p>
<p>Here’s how the process works:</p>
<ol>
<li><p>After <strong>tokenization</strong> and <strong>positional encoding</strong>, your prompt goes into the <strong>multi-head attention model</strong>.</p>
</li>
<li><p>It runs <strong>multiple rounds</strong> of attention to refine its understanding.</p>
</li>
<li><p>The result is passed through a <strong>linear function</strong>, which calculates the <strong>probability of each possible next token</strong>.</p>
</li>
<li><p>These probabilities go into the <strong>SoftMax function</strong>, which picks the most likely next token.</p>
</li>
</ol>
<hr />
<h2 id="heading-what-does-temperature-do">🔥 What Does Temperature Do?</h2>
<p>Ever noticed how AI can either give very <strong>to-the-point</strong> or <strong>creative</strong> answers?</p>
<p>That depends on the <strong>temperature</strong> setting.</p>
<ul>
<li><p><strong>Low temperature</strong> → chooses high-probability tokens → short, factual responses</p>
</li>
<li><p><strong>High temperature</strong> → explores less likely tokens → more creative, diverse responses</p>
</li>
</ul>
<p>It’s kind of like adjusting the “spice level” in your food—more spice (temperature), more variety!</p>
<hr />
<h2 id="heading-training-vs-inference-two-phases-of-an-ai-model">🧪 Training vs. Inference: Two Phases of an AI Model</h2>
<p>Every AI model has <strong>two phases</strong>:</p>
<ol>
<li><p><strong>Training Phase</strong></p>
<ul>
<li><p>This is when the model learns patterns from huge datasets.</p>
</li>
<li><p>During training, the prompt goes through the multi-head attention system <strong>again and again</strong>, refining its predictions until it gets better and better.</p>
</li>
</ul>
</li>
<li><p><strong>Inference Phase</strong></p>
<ul>
<li><p>This is when <em>you</em> use the model.</p>
</li>
<li><p>By this point, the model already knows how to process language and just applies what it has learned.</p>
</li>
</ul>
</li>
</ol>
<hr />
<h2 id="heading-but-what-about-real-time-info-like-weather">🌐 But What About Real-Time Info Like Weather?</h2>
<p>Here’s a fun fact:<br />Most large language models (LLMs) like GPT <strong>can’t access live data</strong> like the current weather—<strong>unless</strong> they are specifically trained on it or connected to tools that can.</p>
<ul>
<li><p>This training process is expensive and usually happens <strong>once or twice a year</strong>.</p>
</li>
<li><p>The last time data was fed into the model is called its <strong>knowledge cutoff date</strong>.</p>
</li>
</ul>
<p>So if you ask an AI:</p>
<blockquote>
<p>“What’s the weather in Delhi today?”</p>
</blockquote>
<p>...unless it's connected to real-time data, it won’t know. But if you trained it on weather data from yesterday, it could answer <strong>about yesterday’s weather</strong> just fine.</p>
<hr />
<h2 id="heading-whats-vocab-size">📚 What's Vocab Size?</h2>
<p>The <strong>vocab size</strong> of a model refers to how many <strong>unique tokens</strong> it can recognize and generate.</p>
<p>A bigger vocab = more language variety<br />A smaller vocab = more limitations in expression</p>
<hr />
<h2 id="heading-final-step-decoding-the-response">🔚 Final Step: Decoding the Response</h2>
<p>Once the model picks the best token using SoftMax, that token is still <strong>just a number</strong>.</p>
<p>To turn that number into something humans can read, it goes through a <strong>decoder</strong>—a machine that converts tokens back into words, sentences, and paragraphs.</p>
<hr />
<h2 id="heading-bringing-it-all-together">🎉 Bringing It All Together</h2>
<p>Here's a quick summary of how your words turn into a smart AI response:</p>
<ol>
<li><p>You type something → gets tokenized by the <strong>encoder</strong></p>
</li>
<li><p>Tokens → converted into <strong>vector embeddings</strong></p>
</li>
<li><p>Embeddings → processed with <strong>positional encoding</strong></p>
</li>
<li><p>Tokens talk to each other via <strong>self-attention</strong></p>
</li>
<li><p>Multiple layers of <strong>multi-head attention</strong> refine the context</p>
</li>
<li><p>A <strong>linear function</strong> calculates the next likely token</p>
</li>
<li><p><strong>SoftMax</strong> chooses the best one based on <strong>temperature</strong></p>
</li>
<li><p>Final token → converted back into text via the <strong>decoder</strong></p>
</li>
<li><p>You get a human-like response. Boom. 💥</p>
</li>
</ol>
<hr />
<p>AI may seem magical on the surface, but underneath, it’s a clever symphony of math, language patterns, and training.</p>
<p>Next time you chat with an AI, remember: it’s a Transformer doing some serious brainwork in milliseconds. ⚡</p>
]]></content:encoded></item></channel></rss>