Pallav Sharda's Blog

Three simple examples of LLM confabulations

Apr 21, 2024 | LLM Concepts

Large Language Models (LLMs) like ChatGPT can handle two aspects of communication very well: plausibility and fluency. Given an input context they determine what are the most probable sequence of words and string them in a way that is superbly eloquent. That makes the output very convincing. But it's no secret that LLMs can provide entirely false outputs - they can confabulate. Not hallucinate...

Curious historical connection between psychology and LLMs

Feb 12, 2024 | LLM Concepts

A few months ago my curiosity around how-are-LLMs-'learning' took me down the rabbit hole of AI and Psychology history and I ended up finding a string of very interesting and related developments from the last 120 years: 1905: Harvard-graduate psychologist Edward L. Thorndike published his 'Law of Effect' which basically says that animal behaviors are shaped by consequences. That is, behaviors...

NLG and the range of tasks within it

Jan 23, 2024 | General AI Notes

The first 10 minutes of this Stanford CS224N lecture explained NLU, NLG and the tasks within and it was helpful. Natural Language Understanding (NLU) is a subset of NLP (processing), which uses syntactic and semantic analysis of text and speech to determine the meaning of a sentence. Natural Language Generation (NLG) is another subset of NLP focusing on the process of producing a human language...

Language Models and GPT’s evolution

Jan 15, 2024 | LLM Concepts

As explained in this Stanford CS50 tech talk, Language Models (LMs) are basically a probability distribution over some vocabulary. For every word we give an LM, it can determine what the most probable word to come after that. It's trained to predict the Nth word, given the previous N-1 words. If that sounds like simple probability calculation, you are not realizing that predicting the next word...

Vector embeddings

Jan 11, 2024 | LLM Concepts

This seemed like the core ideas so I wanted to clarify them conceptually. "Embeddings" emphasizes the notion of representing data in a meaningful and structured way, while "vectors" refers to the numerical representation itself. 'Vector embeddings' is a way to represent different data types (like words, sentences, articles etc) as points in a multidimensional space. Somewhat regrettably, both...

When LLM experts say “We don’t’ know how”

Jan 6, 2024 | General AI Notes

I recently heard Jeff Bezos briefly talk about his views on LLMs here. Less than a minute into the conversation, he said something that struck a chord with me: LLMs in their current form are not inventions, they are discoveries. He followed that up with "we are constantly surprised by their capabilities and they are not really engineered objects". That quote resonated because that we-don't-know...

Three simple examples of LLM confabulations

Curious historical connection between psychology and LLMs

NLG and the range of tasks within it

Language Models and GPT’s evolution

Vector embeddings

When LLM experts say “We don’t’ know how”

Post Categories

Tags