Large Language Models, Artificial Intelligence and Data Science
Syllabus: CS211 Section 301 Spring 2026
Info
3 Credits
Tue 2:10pm - 5:00pm
Room C409
Instructor Information
calvin_williamson@fitnyc.edu
office: B831 Science and Math
office hours: M 1-3, T 12-1, R 12-1
Description
This course provides an introduction to large language models and their capabilities in artificial intelligence and data science. Through theory and hands-on labs, students will gain literacy in model architecture, training techniques, applications in programming, content creation, and more. No prior programming experience is required to take this course. Prerequisite(s): Math Proficiency
Outcomes
- Explain the evolution, capabilities, and limitations of large language models.
- Analyze the core components of LLMs including model architecture, parameters, and training techniques.
- Assess strategies like prompt engineering and fine-tuning to optimize LLM performance.
- Explore mathematical foundations of vector databases and storage including semantic similarity
- Apply LLMs to natural language tasks like classification, summarization, and question answering.
- Utilize LLMs to assist with programming, data science, content creation, and other domains.
- Examine ethical implications of LLMs including bias, misinformation, and legal issues.
Course Materials
We will be using Google Colab, OpenAI ChatGPT, Anthropic Claude, Google Gemini, and other LLM tools for all work in this course. Since these are web-based there is NO OTHER SOFTWARE required for the course besides a web browser.
Topics
Introduction to Large Language Models (LLMs)
- LLM Examples (ChatGPT, Gemini, Claude, DeepSeek)
- Applications: NotebookLM
- Multimodal Model Examples (Gemini 2.5)
Introduction to Python for Artificial Intelligence
- Google Colab Notebook
- Using LLM as Coding Assistant
- Calculations
- Variables
- DataTypes
- Lists
- Dictionaries
- Functions
- Dataframes
- f-Strings
Prompt Engineering Techniques
- Completions, APIs
- System Prompts
- Context Size
- Prompt Structure
- Few-shot and Zero-shot Learning
- Structured Outputs
- Prompt Chaining
- Roles and Personas
- Chain of Thought
- Reasoning Models
- Chat Conversations, Memory
- Multimodal Prompt Techniques
LLM Applications
- Classification
- Artifacts
- Sentiment Analysis
- Question Answering
- Text Summarization
- Named Entity Recognition (NER)
Image Generation Models
- Prompting for Image Generation
- FLUX model
Tools, Agents
- Tools and APIs
- Model Context Protocol (MCPS)
Agentic Software Tools
- Artifacts
- Visual Studio Code (Cline)
- CLI(Command Line) Tools
LLM and Data Science
- Databases And AI-Assisted SQL
- Data Science Agents
Evaluation
Your grade will come from these parts:
- Quizzes (85%)
- In Class/Homework Credits (15%)
Each of these parts is described in more detail below
Quizzes
Your quiz grade will come from 5 quizzes roughly covering 2 or 3 weeks material eachThis quizzes are 30-45 minutes each and are usually 5 or 6 questions each.These quizzes are with no notes, no internet, no phone, no software, no AI tools.Pen and paper and calculator only. They are some multiple choice, some short answer, some true false.
In Class/Homework (1 or 2 per class)
These are credits you obtain for demonstrating you have completed assigned problems. Some of these will come from homework assignments that you show me at the beginning of the class, some of these will come from in class assignments that are done during class and you show as you complete them. You will earn a credit for each successful assignment completion. You must be in attendance to earn these problem credits.
There is NO FINAL EXAM.
AI Policy
All uses of chatbots are encouraged, and there is no restriction on their use. This is especially for topics about large language models (ChatGPT, Gemini, Claude, etc).