← All Webinars | L.A.B.S. #8
AI in Practice: Part 3 | Prompt Engineering Mastery & Model Comparison
Master the art of prompt engineering and gain proficiency in comparing models effectively.
Level: Beginner🐣
Equip yourself with practical tools, strategies, and real-world examples to boost your AI skills and effectiveness.
+
Watch the highlights: https://blog.testsys.com/2024/09/05/ai-in-practice-your-quick-guide-to-practical-ai-use/
+
Interested in partnering on a webinar? Share your ideas at webinars@testsys.com.
1
00:00:15.285 --> 00:00:15.505
Hi.
2
00:00:15.505 --> 00:00:18.385
Welcome everyone. We'll get started in just a few minutes
3
00:00:18.535 --> 00:00:21.585
when I give people time to join us and log in.
4
00:00:23.705 --> 00:00:27.515
Welcome, Andrea. Hi, Amanda. Lovely to see you. You too.
5
00:00:32.535 --> 00:00:34.025
Welcome. Welcome everyone.
6
00:00:40.165 --> 00:00:41.505
I'm gonna give everyone a few minutes
7
00:00:41.565 --> 00:00:43.065
to join coming in from lunch.
8
00:01:06.605 --> 00:01:08.745
We flowing in here. Welcome everyone.
9
00:01:09.015 --> 00:01:10.665
Just a few minutes before we get started.
10
00:01:32.865 --> 00:01:35.565
One more moment and then we'll kick things off.
11
00:01:45.445 --> 00:01:47.445
A lot of familiar faces in the chat.
12
00:01:47.445 --> 00:01:48.805
Thanks for joining us today.
13
00:01:53.675 --> 00:01:55.255
All right, well we are at two
14
00:01:55.255 --> 00:01:56.495
after, so we're gonna go ahead and get started.
15
00:01:56.995 --> 00:01:59.135
Um, thank you everyone for joining us today.
16
00:01:59.235 --> 00:02:01.815
My name is Amanda Crowley. I'm the director of marketing.
17
00:02:01.885 --> 00:02:04.415
I'll be your host and moderator today for this session.
18
00:02:05.235 --> 00:02:07.175
Um, so thank you again for joining us.
19
00:02:07.175 --> 00:02:09.535
Many of you are returning, which I appreciate.
20
00:02:09.795 --> 00:02:12.575
Uh, if you are new here, welcome to ITS Summer Demo Days
21
00:02:13.195 --> 00:02:14.255
AI and Practice series.
22
00:02:14.285 --> 00:02:15.375
This is part three.
23
00:02:15.715 --> 00:02:17.495
Before we get started with Andrea,
24
00:02:17.695 --> 00:02:20.215
I just wanna give you some housekeeping things to go over.
25
00:02:20.985 --> 00:02:22.455
First, we're gonna be using the q
26
00:02:22.455 --> 00:02:24.655
and a that's at the bottom of your Zoom.
27
00:02:25.075 --> 00:02:27.695
Uh, if you have a question, go ahead and put it in there
28
00:02:27.695 --> 00:02:28.975
and we'll answer it live time.
29
00:02:29.395 --> 00:02:32.255
If we happen to get too many, um, that's okay.
30
00:02:32.255 --> 00:02:33.455
We'll answer them after the webinar.
31
00:02:33.715 --> 00:02:35.655
The webinar itself is 45 minutes long,
32
00:02:36.155 --> 00:02:38.295
so this session will be recorded.
33
00:02:38.355 --> 00:02:39.655
If you can't stay the whole time
34
00:02:39.795 --> 00:02:41.895
or you wanna revisit it, we will go ahead
35
00:02:41.895 --> 00:02:43.015
and send it out via email.
36
00:02:43.635 --> 00:02:45.975
And then lastly, when we are done today,
37
00:02:45.995 --> 00:02:47.135
we will send you a survey.
38
00:02:47.515 --> 00:02:49.535
It comes up right when you click out of Zoom.
39
00:02:49.715 --> 00:02:51.415
If you could take a second to answer it,
40
00:02:51.415 --> 00:02:52.695
it's about five questions long.
41
00:02:53.515 --> 00:02:56.135
So thank you. Thank you again for being here.
42
00:02:56.555 --> 00:02:57.815
Um, we are with Andrea.
43
00:02:57.915 --> 00:03:00.525
She is gonna be talking about prompt engineering today,
44
00:03:00.525 --> 00:03:02.405
which is such a fun topic.
45
00:03:02.665 --> 00:03:04.565
Uh, she is our senior product manager
46
00:03:05.105 --> 00:03:07.245
and we also wanted to give a special sh shout
47
00:03:07.245 --> 00:03:08.485
out to Emily Bank.
48
00:03:08.575 --> 00:03:11.765
She's our technical writer who did a lot of, um, time
49
00:03:11.825 --> 00:03:13.965
and effort to help us prepare for this session today.
50
00:03:14.425 --> 00:03:16.525
So with that, I'll turn it over to you, Andrea.
51
00:03:16.935 --> 00:03:19.365
Thank you Amanda. So a little bit about me.
52
00:03:19.485 --> 00:03:21.605
I joined ITS about seven years ago
53
00:03:21.865 --> 00:03:24.445
and I'm on the product management team for item workshop.
54
00:03:24.445 --> 00:03:26.085
That's our item banking platform.
55
00:03:26.705 --> 00:03:28.205
And at the start of this year,
56
00:03:28.425 --> 00:03:31.205
my team rolled out our first Spark AI feature in item
57
00:03:31.485 --> 00:03:34.125
workshop, which was automatic item generation using
58
00:03:34.125 --> 00:03:35.205
generative ai.
59
00:03:36.025 --> 00:03:39.005
And my team's currently busy developing our next round
60
00:03:39.005 --> 00:03:40.805
of features that are gonna harness the power
61
00:03:40.865 --> 00:03:41.925
of generative AI
62
00:03:41.985 --> 00:03:44.805
and machine learning to support analysis tasks
63
00:03:44.905 --> 00:03:46.925
and content generation and item workshop.
64
00:03:47.785 --> 00:03:50.365
But when we started, my team had to jump in
65
00:03:50.425 --> 00:03:53.245
and learn about generative AI and how to write prompts.
66
00:03:53.385 --> 00:03:55.565
So I'm really excited to share with you
67
00:03:55.565 --> 00:03:56.885
what we've learned today.
68
00:03:58.315 --> 00:04:00.765
Wonderful. So to get started, Andrea,
69
00:04:00.835 --> 00:04:02.645
what is a model? What is a provider?
70
00:04:03.455 --> 00:04:05.165
Let's start with artificial intelligence.
71
00:04:05.165 --> 00:04:07.725
Let's go back to the beginning. So what is ai?
72
00:04:08.225 --> 00:04:11.765
So AI is just a set of technologies that enables computers
73
00:04:11.785 --> 00:04:13.845
to emulate human intelligence
74
00:04:13.985 --> 00:04:16.005
and perform tasks that require learning
75
00:04:16.185 --> 00:04:17.845
and reasoning and problem solving.
76
00:04:18.465 --> 00:04:21.485
So then generative AI is just a type of AI
77
00:04:21.675 --> 00:04:23.765
that uses large language models
78
00:04:24.065 --> 00:04:25.645
to generate and analyze content.
79
00:04:26.785 --> 00:04:28.605
So what is a large language model?
80
00:04:29.155 --> 00:04:30.525
Well, a large language model
81
00:04:30.825 --> 00:04:34.445
or an LLM, it's just a generative AI program that's capable
82
00:04:34.625 --> 00:04:37.365
of understanding, analyzing text and images.
83
00:04:38.145 --> 00:04:41.205
And these models, they're trained on large sources of data
84
00:04:41.625 --> 00:04:44.045
and that data is often scraped from the internet.
85
00:04:44.305 --> 00:04:48.645
So you may already be familiar with an LLM, um, for example,
86
00:04:49.565 --> 00:04:53.485
GPT, you may have used chat GBT, well GBT four,
87
00:04:53.625 --> 00:04:54.925
that's an example of a model.
88
00:04:55.835 --> 00:05:00.585
Yeah. Okay. Um, and what about providers?
89
00:05:01.445 --> 00:05:04.465
So the companies that own these models, they're known
90
00:05:04.465 --> 00:05:05.585
as AI providers.
91
00:05:05.845 --> 00:05:10.105
So open AI is the AI provider behind G BT four,
92
00:05:10.725 --> 00:05:14.385
and then Google is the AI provider behind the Model Gemini.
93
00:05:14.845 --> 00:05:17.745
So we're gonna see a list of providers today in the demo.
94
00:05:18.815 --> 00:05:20.025
Okay. All right, got it.
95
00:05:20.405 --> 00:05:23.225
So when you log into a provider
96
00:05:23.365 --> 00:05:27.505
or a model, you begin prompting or or talking to that model.
97
00:05:27.775 --> 00:05:29.745
What does prompt engineering exactly?
98
00:05:30.815 --> 00:05:33.025
Yeah, so prompt engineering is a lot of fun,
99
00:05:33.365 --> 00:05:37.185
but I wanna first start by remembering back to late 2022
100
00:05:37.695 --> 00:05:40.225
when OpenAI launched chat, GBT
101
00:05:40.835 --> 00:05:44.345
generative AI pretty much overnight put the world a buzz
102
00:05:45.485 --> 00:05:47.225
and OpenAI already had other
103
00:05:47.225 --> 00:05:49.065
models, they already had them out.
104
00:05:49.065 --> 00:05:50.265
But what was so brilliant
105
00:05:50.485 --> 00:05:55.265
and addictive about chat GBT was that every man, all of us,
106
00:05:55.725 --> 00:05:57.845
we suddenly could chat with ai.
107
00:05:58.265 --> 00:06:00.325
And it felt like this very democratizing moment
108
00:06:00.995 --> 00:06:05.045
because I'm not a developer and I'm not a data modeler,
109
00:06:05.185 --> 00:06:06.365
but I chat
110
00:06:06.365 --> 00:06:08.245
with my colleagues on the Microsoft
111
00:06:08.245 --> 00:06:09.485
teams throughout my workday.
112
00:06:10.025 --> 00:06:11.805
We all know how to chat and now I could chat
113
00:06:11.805 --> 00:06:13.045
with that AI model.
114
00:06:13.665 --> 00:06:17.085
So essentially prompt engineering is just that.
115
00:06:17.395 --> 00:06:20.685
It's simply chatting with your new colleague, the LLM.
116
00:06:21.545 --> 00:06:24.565
So the prompt itself is just a request, it's just your text
117
00:06:24.565 --> 00:06:26.085
that you send to the generative model
118
00:06:26.465 --> 00:06:28.325
and then the generative model is going
119
00:06:28.325 --> 00:06:30.125
to send you back a response.
120
00:06:31.225 --> 00:06:34.125
So you may be a prompt engineering newbie, um,
121
00:06:34.245 --> 00:06:37.765
coming into this webinar, but the skills you use every day
122
00:06:37.865 --> 00:06:39.805
to write your text, write your emails,
123
00:06:40.065 --> 00:06:41.845
and to help train up your teammates,
124
00:06:42.195 --> 00:06:44.925
they're gonna make you a fantastic prompt engineer.
125
00:06:46.075 --> 00:06:49.165
That AI model is just your highly logical teammate
126
00:06:49.165 --> 00:06:52.165
who needs your support and your coaching to be successful.
127
00:06:53.385 --> 00:06:55.515
Yeah, so I was sharing with Andrea
128
00:06:55.615 --> 00:06:57.235
before the session got started.
129
00:06:57.695 --> 00:06:59.475
She was the reason I really started playing
130
00:06:59.505 --> 00:07:02.995
with tools like Chachi, bt I was so intimidated.
131
00:07:03.215 --> 00:07:05.995
I'm not, you know, a computer engineer or anything.
132
00:07:06.135 --> 00:07:08.395
And Andrea said, oh no, like this is built
133
00:07:08.395 --> 00:07:10.315
for people like you that I'm not.
134
00:07:10.535 --> 00:07:13.395
Um, well are technically challenged for better words
135
00:07:13.975 --> 00:07:15.235
and go in and try it.
136
00:07:15.495 --> 00:07:17.155
And so when I first started trying it,
137
00:07:17.315 --> 00:07:20.995
I was getting responses that were just not useful for me.
138
00:07:21.455 --> 00:07:23.675
So is there anything that you can,
139
00:07:23.675 --> 00:07:25.915
advice you can give us about improving the quality
140
00:07:25.915 --> 00:07:27.995
of responses from the AI model?
141
00:07:28.865 --> 00:07:31.915
Yeah, I'm gonna give you five tips today. Okay.
142
00:07:31.975 --> 00:07:33.315
Um, so we're gonna go through five tips
143
00:07:33.315 --> 00:07:35.075
to keep in mind when writing those prompts.
144
00:07:35.415 --> 00:07:36.955
And Amanda, if there's time at the end,
145
00:07:37.075 --> 00:07:39.155
I have a couple bonus tips that I'd love to share.
146
00:07:39.545 --> 00:07:41.355
Sure. So there's a couple things
147
00:07:41.495 --> 00:07:42.715
I'd like to share up front though.
148
00:07:43.335 --> 00:07:46.355
So the first is don't give up if you don't get
149
00:07:46.435 --> 00:07:47.475
a good response initially.
150
00:07:48.465 --> 00:07:51.275
When you send an email to your colleague, um, trying
151
00:07:51.275 --> 00:07:52.595
to elicit information from them
152
00:07:52.895 --> 00:07:55.315
and they, their response fails to answer your questions,
153
00:07:55.655 --> 00:07:57.875
you don't give up, you follow up,
154
00:07:57.935 --> 00:08:00.475
you send an additional email, giving them some more details
155
00:08:01.055 --> 00:08:02.795
and making sure your questions are clear
156
00:08:02.815 --> 00:08:04.715
and concise so that they can respond to you.
157
00:08:05.615 --> 00:08:07.715
Do the same thing with the generative ai.
158
00:08:07.975 --> 00:08:09.955
If you don't get what you're looking for the first time,
159
00:08:10.605 --> 00:08:12.115
apply some of the tips from today
160
00:08:12.295 --> 00:08:14.355
and keep trying, keep trying again.
161
00:08:15.295 --> 00:08:17.475
The second thing I'd like to add up front is
162
00:08:17.475 --> 00:08:20.715
that the quality of the responses can differ across models.
163
00:08:21.575 --> 00:08:24.915
So for example, GBT four Omni, which was
164
00:08:25.455 --> 00:08:29.195
the most recently released, um, model from OpenAI,
165
00:08:29.385 --> 00:08:31.955
it's faster, it's smarter, it's bigger,
166
00:08:32.065 --> 00:08:34.355
it's better than the earlier GPT models like
167
00:08:34.355 --> 00:08:35.635
Da Vinci and GPT-3.
168
00:08:36.335 --> 00:08:37.835
So the models are changing
169
00:08:38.295 --> 00:08:40.355
and they do have different, different abilities
170
00:08:41.095 --> 00:08:43.835
and you're gonna wanna try several models out to find
171
00:08:43.835 --> 00:08:45.435
what works best for your use case.
172
00:08:46.055 --> 00:08:47.715
Um, different models have different strengths.
173
00:08:47.815 --> 00:08:50.715
Amanda, if time allows at the end of this webinar, I'd love
174
00:08:50.715 --> 00:08:52.995
to cover some of the basics of model differences.
175
00:08:57.255 --> 00:08:59.265
Okay. Yeah, we can definitely do that.
176
00:08:59.485 --> 00:09:03.865
Um, at ITS, we have a Spark AI playground,
177
00:09:04.245 --> 00:09:05.665
so it's something that some
178
00:09:05.665 --> 00:09:07.425
of our internal teams have been using.
179
00:09:07.725 --> 00:09:09.105
Can you tell me a little bit about that?
180
00:09:09.885 --> 00:09:11.025
I'd love to. So we're actually
181
00:09:11.025 --> 00:09:12.065
gonna use the playground today.
182
00:09:12.575 --> 00:09:15.145
Okay. Um, and the Spark AI playground,
183
00:09:15.485 --> 00:09:17.665
it was originally created as an internal tool
184
00:09:17.665 --> 00:09:18.825
for ITS employees
185
00:09:19.205 --> 00:09:20.665
and it was created by our director
186
00:09:20.725 --> 00:09:24.505
of Innovative Technologies, Chris CLN and our IDC team.
187
00:09:25.125 --> 00:09:27.305
And this was really an internal tool initially
188
00:09:27.405 --> 00:09:30.545
and it was to help ITS employees learn about prompt
189
00:09:30.865 --> 00:09:32.625
engineering and to try out their ideas.
190
00:09:33.415 --> 00:09:36.225
It's been incredibly useful tools to all of us internally
191
00:09:36.925 --> 00:09:39.225
and here at ITS we're dedicated
192
00:09:39.485 --> 00:09:41.305
to innovating with our clients.
193
00:09:41.845 --> 00:09:44.345
So we decided to extend access
194
00:09:44.365 --> 00:09:46.665
to our Spark AI playground to our clients.
195
00:09:47.445 --> 00:09:49.345
So I'm gonna share my screen in just a moment
196
00:09:49.365 --> 00:09:51.025
and I'll give you guys a tour of the playground.
197
00:09:52.025 --> 00:09:52.725
Sounds great.
198
00:10:00.315 --> 00:10:01.695
Amanda, can you see my screen?
199
00:10:02.155 --> 00:10:05.295
Yep, I sure can. Great. Okay.
200
00:10:05.915 --> 00:10:08.455
So the first thing I'd like to point out in the Spark AI
201
00:10:08.455 --> 00:10:10.735
playground is that you can use it to generate text
202
00:10:11.115 --> 00:10:12.255
or to generate images.
203
00:10:12.745 --> 00:10:14.615
Today we're gonna be generating text.
204
00:10:15.525 --> 00:10:17.295
Next thing I'd like to point out is this
205
00:10:17.355 --> 00:10:18.775
prompt text box up here.
206
00:10:18.775 --> 00:10:21.015
This is where we're going to be entering our prompts today.
207
00:10:21.635 --> 00:10:23.335
And once we're done entering our prompts,
208
00:10:23.335 --> 00:10:24.775
we're gonna click the submit button.
209
00:10:25.725 --> 00:10:28.815
This response area is where the AI model is going
210
00:10:28.815 --> 00:10:30.575
to share their responses back to me.
211
00:10:31.435 --> 00:10:33.655
So for the first model we're gonna look at today,
212
00:10:33.915 --> 00:10:37.045
I'm gonna select OpenAI
213
00:10:38.955 --> 00:10:43.705
and I'm gonna select that new, okay,
214
00:10:44.055 --> 00:10:45.105
technical difficulty.
215
00:10:45.245 --> 00:10:46.785
I'm gonna have to sign out, I'm gonna have
216
00:10:46.785 --> 00:10:50.765
to refresh my page and
217
00:10:50.765 --> 00:10:52.005
potentially sign back in.
218
00:11:05.845 --> 00:11:09.035
There we go. I think I just had had this waiting a little
219
00:11:09.035 --> 00:11:11.715
too long on my screen that I timed out.
220
00:11:12.575 --> 00:11:16.035
So I'm gonna go ahead and select OpenAI as my provider.
221
00:11:16.175 --> 00:11:18.475
And then from the model list of OpenAI models,
222
00:11:18.695 --> 00:11:20.755
I'm gonna select GPT-4 Omni.
223
00:11:21.495 --> 00:11:25.715
Now when I send my re response GPT-4, I'm send my prompt.
224
00:11:26.035 --> 00:11:27.795
GPT-4 is going to share its response
225
00:11:27.915 --> 00:11:29.235
with me in this text box here.
226
00:11:29.535 --> 00:11:32.795
But I wanna compare the performance of G PT four
227
00:11:32.795 --> 00:11:35.195
to another model to see how they respond
228
00:11:35.195 --> 00:11:36.395
to my, my single prompt.
229
00:11:36.855 --> 00:11:38.795
So I'm gonna click add compare here,
230
00:11:39.575 --> 00:11:42.035
and in this response section I'm gonna choose another model.
231
00:11:42.565 --> 00:11:44.235
Let's go with Google Gemini.
232
00:11:46.125 --> 00:11:48.465
And Google Gemini is gonna send its response
233
00:11:48.465 --> 00:11:50.505
to me in this text box here at the bottom.
234
00:11:50.925 --> 00:11:53.965
Now I could start adding additional models as many
235
00:11:53.965 --> 00:11:55.645
as I want across all 10
236
00:11:55.705 --> 00:11:58.245
or 11 of those providers that you saw on the provider list
237
00:11:58.585 --> 00:12:00.685
to compare how they all respond to my prompts.
238
00:12:00.905 --> 00:12:02.605
But I'm gonna keep it pretty simple right now
239
00:12:02.605 --> 00:12:03.965
for the purposes of this demo.
240
00:12:05.065 --> 00:12:07.045
So let's start with prompt number one.
241
00:12:07.345 --> 00:12:09.365
I'm gonna copy and paste it into this prompt
242
00:12:09.595 --> 00:12:10.605
text box right here.
243
00:12:18.675 --> 00:12:22.565
Okay. So my first tip is to assign
244
00:12:23.495 --> 00:12:24.935
a persona to the ai.
245
00:12:25.915 --> 00:12:28.535
So when I'm added to a new work project, one
246
00:12:28.535 --> 00:12:31.015
of my first questions is, what is my role on this team?
247
00:12:31.795 --> 00:12:35.655
And the answer to that question changes the work I do
248
00:12:35.955 --> 00:12:37.375
and what I share back with the team.
249
00:12:38.035 --> 00:12:41.175
So for example, if a colleague emails me a draft
250
00:12:41.355 --> 00:12:45.055
of an ITS blog post, I'm gonna wanna know is my role
251
00:12:45.155 --> 00:12:47.455
as editor, am I looking for spelling and grammar?
252
00:12:48.035 --> 00:12:50.815
Or is my role a writer role where they're expecting me
253
00:12:50.815 --> 00:12:53.295
to add content and rewrite this blog post?
254
00:12:53.955 --> 00:12:58.295
So treat the spark ai, uh, treat the AI model the same way.
255
00:12:58.365 --> 00:13:00.375
Tell it what you're expecting, what is its role.
256
00:13:01.075 --> 00:13:03.575
So our first tip is add your persona.
257
00:13:03.575 --> 00:13:05.895
And in this case I'm asking the AI
258
00:13:05.895 --> 00:13:07.215
to be a member of our support team.
259
00:13:07.695 --> 00:13:12.055
'cause my goal is to get the AI to share back
260
00:13:12.075 --> 00:13:14.775
and report to me on our customer feedback.
261
00:13:15.075 --> 00:13:16.215
So this is a really simple,
262
00:13:16.855 --> 00:13:18.735
a simple prompt other than the persona.
263
00:13:18.755 --> 00:13:21.815
And I have my list of customer feedback
264
00:13:21.815 --> 00:13:23.015
here in this bulleted list.
265
00:13:23.395 --> 00:13:26.855
So we're gonna continue building on this prompt as we move
266
00:13:26.855 --> 00:13:27.855
through the webinar tips.
267
00:13:28.835 --> 00:13:31.175
But let's get started by clicking the submit button
268
00:13:31.395 --> 00:13:33.575
and see how the two models do.
269
00:13:34.355 --> 00:13:37.335
So right now what we're doing is the Spark AI playground is
270
00:13:37.335 --> 00:13:39.255
setting an API call to these models.
271
00:13:39.915 --> 00:13:43.335
Now the prompt that I entered will not be shared, um,
272
00:13:43.445 --> 00:13:45.335
back into their, their training data set.
273
00:13:45.755 --> 00:13:47.815
So there's no concern over that.
274
00:13:48.715 --> 00:13:52.255
And you can see Gemini was a little faster than GBT.
275
00:13:52.395 --> 00:13:54.135
So let's take a look at these responses.
276
00:13:55.555 --> 00:13:57.855
So we can see we got a pretty basic report.
277
00:13:58.515 --> 00:14:01.495
Um, it's almost fitting back to me exactly what I had from
278
00:14:01.495 --> 00:14:04.455
that list and I feel like we could do better.
279
00:14:04.865 --> 00:14:06.415
Let's see how Gemini did.
280
00:14:07.775 --> 00:14:11.055
I am getting a report with common issues and some feedback,
281
00:14:11.515 --> 00:14:13.215
but I wanna spice this up a little.
282
00:14:13.375 --> 00:14:14.815
I wanna get a more thorough report.
283
00:14:15.235 --> 00:14:17.655
So let's move on to prompt number two.
284
00:14:24.575 --> 00:14:28.035
I'm just gonna copy and paste it here in my prompt text box.
285
00:14:30.575 --> 00:14:34.635
So my second tip is to tell the model your spec, your
286
00:14:35.195 --> 00:14:36.355
specific expectations.
287
00:14:37.575 --> 00:14:41.075
For example, I may wanna specify that the report be pro uh,
288
00:14:41.105 --> 00:14:42.675
formatted in a table
289
00:14:42.975 --> 00:14:45.755
or have a specified number of paragraphs or word count.
290
00:14:46.495 --> 00:14:48.355
So I like to think about the instructions
291
00:14:48.355 --> 00:14:50.715
that I give my real world teammates when I
292
00:14:50.715 --> 00:14:51.805
hand off a work task.
293
00:14:52.265 --> 00:14:53.685
So I work with a technical writer
294
00:14:54.385 --> 00:14:56.205
and when she starts on a new document,
295
00:14:56.505 --> 00:14:57.685
we have a conversation
296
00:14:58.185 --> 00:15:01.365
and we talk about what the tone should be for that document.
297
00:15:02.905 --> 00:15:06.045
We also talk about how long we want that document to be.
298
00:15:07.065 --> 00:15:10.045
And then for complex documents, we often talk about
299
00:15:10.185 --> 00:15:11.885
how we should organize the information
300
00:15:12.385 --> 00:15:13.765
and section that document.
301
00:15:14.385 --> 00:15:17.445
And I like to do that as well with, with my AI model.
302
00:15:18.225 --> 00:15:21.925
So now that we gave some ex explicit expectations
303
00:15:21.945 --> 00:15:23.885
to the model, let's run this prompt
304
00:15:24.145 --> 00:15:25.685
and see how our two models do.
305
00:15:32.855 --> 00:15:36.825
Okay, again, it looks like Google Gemini won the race.
306
00:15:37.205 --> 00:15:38.225
So let's take a look.
307
00:15:39.085 --> 00:15:41.865
So I can see that I got the table that I asked for
308
00:15:41.865 --> 00:15:42.905
to format in tables.
309
00:15:43.505 --> 00:15:45.265
I am gonna point out that these numbers look
310
00:15:45.495 --> 00:15:46.785
supe suspicious to me.
311
00:15:47.345 --> 00:15:48.865
I would double check them
312
00:15:49.055 --> 00:15:51.505
because this to me looks like a hallucination.
313
00:15:51.845 --> 00:15:53.665
So it looks like Google Gemini is doing
314
00:15:53.665 --> 00:15:54.825
some hallucinating here.
315
00:15:55.805 --> 00:15:57.945
We can see I got the sections that I asked for
316
00:15:59.045 --> 00:16:02.145
and I also asked it to supply a proposed action item.
317
00:16:04.145 --> 00:16:08.255
Let's see how GBT four did also getting my sections.
318
00:16:08.255 --> 00:16:09.895
This is a better formatted report.
319
00:16:10.655 --> 00:16:13.255
I have my table, my frequencies knowing
320
00:16:13.275 --> 00:16:14.415
the data that I sent in.
321
00:16:14.415 --> 00:16:15.535
These look correct to me.
322
00:16:18.875 --> 00:16:20.975
And I also get, I asked for an action item
323
00:16:21.035 --> 00:16:22.175
and it's responding with that.
324
00:16:22.195 --> 00:16:23.895
So you can see this report is already getting a better
325
00:16:24.215 --> 00:16:25.455
structure, a better foundation,
326
00:16:25.555 --> 00:16:27.695
and better returns than my first prompt,
327
00:16:28.475 --> 00:16:30.215
but we can make it even better.
328
00:16:31.195 --> 00:16:32.975
So let's go on to prompt number three.
329
00:16:39.905 --> 00:16:41.835
Okay, this is actually my favorite tip.
330
00:16:45.385 --> 00:16:46.405
So prompt number three.
331
00:16:46.785 --> 00:16:47.805
Uh, my favorite tip is
332
00:16:47.805 --> 00:16:50.485
to organize your prompt into sections.
333
00:16:51.505 --> 00:16:55.445
So my colleagues at ITS know that I often add sections
334
00:16:55.445 --> 00:16:57.165
to my emails and it's
335
00:16:57.165 --> 00:17:00.525
to break up my emails into like logical, logical sections.
336
00:17:00.645 --> 00:17:03.685
I find it's easier for me to organize my idea, my ideas
337
00:17:03.685 --> 00:17:05.605
and make sure I convey everything that I need to convey.
338
00:17:05.985 --> 00:17:08.565
And I like to think it makes it easier on my colleagues
339
00:17:08.585 --> 00:17:10.805
to scan my emails and find what they're looking for.
340
00:17:11.505 --> 00:17:14.605
So I find it works the same way with the, with the AI model.
341
00:17:15.345 --> 00:17:16.605
Now if I'm organized
342
00:17:16.605 --> 00:17:19.485
and I'm giving it all its information, it's more likely to,
343
00:17:19.705 --> 00:17:21.805
to treat that data, um, more equal
344
00:17:21.905 --> 00:17:23.485
and to supply the response I'm looking for.
345
00:17:24.545 --> 00:17:27.765
So I added delimiters and in this case section headers.
346
00:17:27.905 --> 00:17:30.685
So persona, instructions, length structure
347
00:17:30.985 --> 00:17:32.205
to organize my prompt.
348
00:17:32.665 --> 00:17:36.645
But you could also use XML tags or markdown languages
349
00:17:37.305 --> 00:17:39.525
or other, other headers that you see fit.
350
00:17:40.555 --> 00:17:43.175
So I find this helps the generative AI stay on task.
351
00:17:43.585 --> 00:17:44.535
Let's see how they do.
352
00:17:52.305 --> 00:17:55.085
All right, again, Google Gemini won the race.
353
00:17:56.105 --> 00:17:58.925
So this report is getting even more in depth.
354
00:17:59.905 --> 00:18:02.405
So we're getting our introduction section, our summary
355
00:18:02.585 --> 00:18:03.685
of support requests.
356
00:18:04.165 --> 00:18:06.125
I love that it's giving me these frequencies.
357
00:18:06.325 --> 00:18:07.405
I would double check them though,
358
00:18:07.985 --> 00:18:09.405
and descriptions of the issues.
359
00:18:10.545 --> 00:18:13.365
So this is getting enriched, right?
360
00:18:13.965 --> 00:18:17.885
I had also asked in this prompt for it to give me, um, kind
361
00:18:17.885 --> 00:18:19.325
of summarize each section.
362
00:18:19.465 --> 00:18:20.925
And it looks like Gemini did not
363
00:18:20.925 --> 00:18:22.205
respond to that section of the prompt.
364
00:18:22.215 --> 00:18:26.815
Super well, let's see how GPT-4 did.
365
00:18:27.785 --> 00:18:29.095
Again, I have my sections.
366
00:18:30.395 --> 00:18:33.915
I like my frequency counts and it looks like open AI's.
367
00:18:34.235 --> 00:18:37.555
GPT-4 was better at supplying that summary to each section
368
00:18:37.555 --> 00:18:38.795
that I asked it to supply.
369
00:18:39.455 --> 00:18:40.995
So you can see my report is starting
370
00:18:40.995 --> 00:18:43.275
to get more fleshed out and more useful.
371
00:18:43.745 --> 00:18:45.515
This is turning into something I could hand over
372
00:18:45.515 --> 00:18:46.515
to my executive team.
373
00:18:49.325 --> 00:18:50.935
Okay, so prompt number four,
374
00:18:56.455 --> 00:18:58.515
I'm just gonna copy and paste this one in.
375
00:18:59.495 --> 00:19:00.635
And then let's take a look.
376
00:19:03.235 --> 00:19:06.135
So tip number four is to try adding an example
377
00:19:06.315 --> 00:19:07.415
of what you're looking for.
378
00:19:09.115 --> 00:19:13.055
So I tend to use this tip if my earlier prompts are not
379
00:19:13.075 --> 00:19:14.735
as working as well as I would like them to.
380
00:19:14.915 --> 00:19:17.095
I'm just still not getting the format I want.
381
00:19:17.235 --> 00:19:18.615
I'm not getting the tone I want.
382
00:19:18.735 --> 00:19:20.535
I need to give an example to the ai.
383
00:19:21.475 --> 00:19:23.375
So all of our prompts up
384
00:19:23.375 --> 00:19:25.575
to this point have been zero shot prompts,
385
00:19:25.605 --> 00:19:27.295
haven't supplied any examples in them.
386
00:19:28.005 --> 00:19:31.775
This prompt that I just pasted in is a single shot prompt.
387
00:19:32.395 --> 00:19:34.135
So you can see here I have four example
388
00:19:34.755 --> 00:19:36.255
and I actually gave it my example
389
00:19:36.935 --> 00:19:40.215
'cause I wanted to format my action items kind
390
00:19:40.215 --> 00:19:42.415
of in this wording format where it starts
391
00:19:42.415 --> 00:19:43.575
with this action verb.
392
00:19:44.365 --> 00:19:47.695
Okay? So I'm gonna send this, this new prompt
393
00:19:47.725 --> 00:19:49.695
with the example in it over to my models
394
00:19:49.915 --> 00:19:50.935
and see how they perform.
395
00:19:57.625 --> 00:19:59.675
Okay, let's check out on, uh, Gemini.
396
00:20:00.695 --> 00:20:02.075
So again, I get my sections.
397
00:20:02.105 --> 00:20:04.555
This is becoming a really nice fleshed out report
398
00:20:06.405 --> 00:20:08.865
and you can see that my action items are new newly
399
00:20:08.865 --> 00:20:11.105
formatted, using my action verbs to start.
400
00:20:11.415 --> 00:20:13.625
This is closer to what I was hoping for
401
00:20:14.885 --> 00:20:16.945
and it's giving me some interesting action items.
402
00:20:18.395 --> 00:20:20.745
Let's take a look at how GPT-4 did.
403
00:20:25.215 --> 00:20:26.635
So again, I get my sections.
404
00:20:28.275 --> 00:20:29.615
I'm scrolling down my action items.
405
00:20:30.365 --> 00:20:32.775
This was my exact example
406
00:20:33.755 --> 00:20:35.975
and I'm happy that GPT-4
407
00:20:36.575 --> 00:20:38.735
returned my exact example for this reason.
408
00:20:39.355 --> 00:20:41.535
One of the reasons that I tend
409
00:20:41.535 --> 00:20:44.895
to not supply an example in my initial prompts is
410
00:20:44.895 --> 00:20:47.095
because sometimes they will bleed over into
411
00:20:47.265 --> 00:20:48.485
the model's response.
412
00:20:49.305 --> 00:20:51.325
And what I mean by that is it duplicates.
413
00:20:51.385 --> 00:20:54.285
My example in the response, I didn't want GPT-4
414
00:20:54.285 --> 00:20:55.445
to duplicate my example.
415
00:20:55.605 --> 00:20:57.605
I wanted it to gimme some new action items,
416
00:20:57.775 --> 00:21:00.685
insightful action items, um, instead
417
00:21:00.685 --> 00:21:03.285
of just repeating myself to me for verbatim.
418
00:21:04.185 --> 00:21:06.565
So that's a little disappointing from GPT-4,
419
00:21:06.625 --> 00:21:09.085
but you get to see how the two models reacted to
420
00:21:09.085 --> 00:21:10.125
that example differently.
421
00:21:10.865 --> 00:21:13.605
So I tend to use examples for just one-off prompts
422
00:21:13.605 --> 00:21:15.845
that I don't plan to reuse if I use them.
423
00:21:16.185 --> 00:21:18.885
Um, I am weary of putting them in prompts
424
00:21:18.885 --> 00:21:21.365
that get reused repeatedly, especially ones
425
00:21:21.365 --> 00:21:22.765
that get repeated by programs.
426
00:21:24.045 --> 00:21:28.645
I, okay. Okay, I'm gonna move on
427
00:21:28.645 --> 00:21:29.885
to our fifth prompt.
428
00:21:33.865 --> 00:21:36.675
Just gonna copy and paste this in so that we can all see it.
429
00:21:50.105 --> 00:21:52.795
Okay, tip number five, I want you
430
00:21:52.795 --> 00:21:55.155
to think about back when you were onboarding a
431
00:21:55.155 --> 00:21:56.355
mem a new member to your team.
432
00:21:57.985 --> 00:21:59.395
When you onboard a new member
433
00:22:00.095 --> 00:22:01.915
and they're given a complex task,
434
00:22:01.915 --> 00:22:03.795
they're often not sure how to get started.
435
00:22:03.985 --> 00:22:05.115
It's very daunting.
436
00:22:06.055 --> 00:22:08.795
And what I do for new team members on my team is I start
437
00:22:09.075 --> 00:22:12.475
breaking up those, those complex tasks into smaller steps.
438
00:22:13.295 --> 00:22:17.315
So you can do that as well for your prompt here to the AI
439
00:22:17.975 --> 00:22:20.395
in this, in this case, I'm breaking telling it
440
00:22:20.395 --> 00:22:22.235
to follow these steps to write my report.
441
00:22:22.435 --> 00:22:25.115
I wanted to categorize the requests and the feedback.
442
00:22:25.835 --> 00:22:27.435
I wanted to count the frequencies.
443
00:22:28.955 --> 00:22:31.275
I like number three, I wanted to highlight the most
444
00:22:31.835 --> 00:22:33.115
frequent support requests
445
00:22:33.255 --> 00:22:36.275
and share with me any patterns that it's observing.
446
00:22:37.065 --> 00:22:39.395
Okay? I wanna assess the potential impact
447
00:22:39.455 --> 00:22:41.675
of each issue on the user experience.
448
00:22:42.955 --> 00:22:45.375
And I wanna separate the feedback into positive
449
00:22:45.475 --> 00:22:49.645
and negative pieces of feedback. Basically,
450
00:22:49.705 --> 00:22:52.645
Andrea, on this one, essentially the step-by-step is like
451
00:22:53.255 --> 00:22:56.885
debriefing a team member on all the context they need
452
00:22:56.885 --> 00:22:59.245
to give you exactly what you want, right? Yeah.
453
00:22:59.315 --> 00:23:00.485
It's getting really detailed.
454
00:23:00.635 --> 00:23:02.765
Like these are, these are the things that I'm hoping
455
00:23:02.785 --> 00:23:05.045
to see in this output so that I'm,
456
00:23:05.405 --> 00:23:06.565
'cause it's what I need in my report
457
00:23:06.785 --> 00:23:08.925
and it's what's gonna satisfy the original request.
458
00:23:09.185 --> 00:23:11.325
So it's really breaking it down into steps
459
00:23:11.345 --> 00:23:13.645
so it's more achievable, um,
460
00:23:13.745 --> 00:23:15.685
and leaves less to to chance here.
461
00:23:16.105 --> 00:23:18.725
So another term for this kind of prompting, Amanda,
462
00:23:19.105 --> 00:23:20.605
is chain of thought prompting.
463
00:23:20.745 --> 00:23:22.805
So if you ever hear that term when you're looking up,
464
00:23:22.805 --> 00:23:26.485
prompting, prompting tips, um, asking, giving it step
465
00:23:26.485 --> 00:23:27.485
by step instructions.
466
00:23:27.485 --> 00:23:29.485
It's called chain of thought prompting.
467
00:23:30.985 --> 00:23:34.045
So we also have a question. So for this one mm-Hmm.
468
00:23:34.125 --> 00:23:36.485
Um, someone asked, would you please comment
469
00:23:36.665 --> 00:23:39.725
how the configuration options to the right may be used?
470
00:23:39.945 --> 00:23:43.045
So if you wanted to walk through some of that as well.
471
00:23:43.135 --> 00:23:45.885
Great. Absolutely. Um, I, I can share some of that.
472
00:23:46.265 --> 00:23:49.725
Um, can I, Amanda, I'm gonna finish first with showing how,
473
00:23:49.905 --> 00:23:52.445
how the prompt the AI response to this prompt,
474
00:23:52.545 --> 00:23:53.725
and then we can talk about those
475
00:23:53.965 --> 00:23:55.285
configuration options on the right.
476
00:23:55.545 --> 00:23:57.965
Sounds good. Okay. Okay.
477
00:23:57.965 --> 00:24:00.165
So I'm gonna send this over to the, to the ai.
478
00:24:00.185 --> 00:24:02.365
And one of the reasons I really like chain
479
00:24:02.385 --> 00:24:04.165
of thought prompting is
480
00:24:04.165 --> 00:24:06.365
because ai, like I'm sending them the prompts,
481
00:24:06.525 --> 00:24:08.645
I don't know what's happening on happening over there,
482
00:24:08.665 --> 00:24:10.205
and then it's sending me this response.
483
00:24:10.225 --> 00:24:11.645
It feels almost like a black box,
484
00:24:12.065 --> 00:24:14.205
but when I'm able to tell it how I want it to think
485
00:24:14.205 --> 00:24:17.125
through the problem, I have a bit of an idea of
486
00:24:17.225 --> 00:24:18.525
how it's gonna work through the problem
487
00:24:18.545 --> 00:24:20.085
and return that response to me.
488
00:24:20.705 --> 00:24:25.565
Um, all right, let's start with our GPT-4 response.
489
00:24:26.595 --> 00:24:28.605
Okay. We can see I got my frequencies
490
00:24:29.545 --> 00:24:32.365
and you can see I have my an area now explaining the most
491
00:24:32.845 --> 00:24:35.485
frequent support requests and the patterns I was asking for,
492
00:24:35.705 --> 00:24:39.365
and also the impact of those issues on the user experience.
493
00:24:40.025 --> 00:24:41.725
So I'm getting a more meaningful
494
00:24:41.725 --> 00:24:43.485
and enriched report that I can really share
495
00:24:43.485 --> 00:24:47.445
with my executives and move things forward.
496
00:24:48.825 --> 00:24:50.925
So same with here with Google Gemini,
497
00:24:51.145 --> 00:24:53.925
you can see it really responded back with the summaries.
498
00:24:54.595 --> 00:24:56.125
It's highlighting the most frequent issues
499
00:24:56.125 --> 00:24:57.565
and patterns as I asked it to,
500
00:24:58.705 --> 00:25:00.845
and it's responding with that user experience
501
00:25:00.845 --> 00:25:03.605
that I wasn't getting with my earlier prompts.
502
00:25:04.595 --> 00:25:06.325
Okay, so let's shift over to some
503
00:25:06.325 --> 00:25:08.645
of these settings here on here on the right.
504
00:25:09.105 --> 00:25:12.285
So over in the playground, we expose these settings for you.
505
00:25:12.305 --> 00:25:13.405
So you have a lot of control
506
00:25:13.405 --> 00:25:15.165
and you can play with them to see how it changes
507
00:25:15.465 --> 00:25:19.405
how the model responds to your, to your prompt.
508
00:25:19.905 --> 00:25:21.605
And you're gonna see the settings are not
509
00:25:21.605 --> 00:25:22.845
identical for all the models.
510
00:25:22.945 --> 00:25:24.605
So it really depends on what the provider
511
00:25:24.625 --> 00:25:25.685
and the model support.
512
00:25:26.865 --> 00:25:31.485
So temperature, temperature directly controls randomness.
513
00:25:32.225 --> 00:25:35.205
So you can think of randomness as akin to creativity.
514
00:25:35.825 --> 00:25:38.725
How creative is the AI model gonna be in
515
00:25:38.885 --> 00:25:40.325
choosing the next word?
516
00:25:40.795 --> 00:25:43.525
Okay, so a higher temperature, if I move this higher,
517
00:25:44.055 --> 00:25:46.005
we're gonna get more creative.
518
00:25:46.595 --> 00:25:50.085
Okay? And if I keep it really kinda low, it's going
519
00:25:50.085 --> 00:25:53.045
to be less creative and it's gonna give more predictable
520
00:25:53.425 --> 00:25:54.845
and consistent output.
521
00:25:55.795 --> 00:25:58.765
Okay? So now top P is a little different.
522
00:25:59.065 --> 00:26:01.205
Um, generally I don't recommend using both
523
00:26:01.205 --> 00:26:02.605
temperature and top P together.
524
00:26:03.225 --> 00:26:07.765
Um, top P tells the model kind of how to select a pool
525
00:26:07.945 --> 00:26:11.565
of next words and then select the next word from that pool.
526
00:26:12.145 --> 00:26:13.765
So that's how it's handling kind
527
00:26:13.765 --> 00:26:15.245
of creativity and randomness.
528
00:26:16.965 --> 00:26:20.545
Now, this presence penalty, this is to encourage the model
529
00:26:20.605 --> 00:26:23.225
to include a more diverse range of tokens.
530
00:26:23.245 --> 00:26:26.705
Tokens are like three-fourths a word, and a higher value.
531
00:26:26.895 --> 00:26:28.905
It's gonna result in the model being more likely
532
00:26:28.905 --> 00:26:31.665
to generate, um, tokens.
533
00:26:31.685 --> 00:26:33.305
So words that haven't been used
534
00:26:33.705 --> 00:26:35.025
previously within the response.
535
00:26:36.585 --> 00:26:38.805
And then the frequency penalty.
536
00:26:40.065 --> 00:26:42.005
Um, the frequency penalty that's used
537
00:26:42.005 --> 00:26:44.125
to discourage the model from repeating the same words
538
00:26:44.125 --> 00:26:47.125
and phrases too often within the generated response.
539
00:26:47.665 --> 00:26:51.005
So a higher value is gonna result in the model being more
540
00:26:51.085 --> 00:26:54.365
conservative in its use of repeating the tokens.
541
00:26:54.745 --> 00:26:57.805
So there's a lot of overlap in how, how these things work.
542
00:26:57.865 --> 00:27:01.365
And really you can play with them to keep refining, um,
543
00:27:01.665 --> 00:27:03.445
how the quality of that response.
544
00:27:04.745 --> 00:27:05.765
Now my favorite,
545
00:27:06.585 --> 00:27:09.405
my favorite setting over here is number of responses.
546
00:27:09.865 --> 00:27:13.125
So I can change the number of responses and send a prompt.
547
00:27:13.265 --> 00:27:18.165
And if I have it set to two, I'm asking GBT four to send me,
548
00:27:19.385 --> 00:27:21.845
uh, to send two responses.
549
00:27:22.465 --> 00:27:24.885
And I can see how the two different responses differ.
550
00:27:25.665 --> 00:27:30.225
So I'm gonna copy
551
00:27:30.225 --> 00:27:31.465
and paste my last prompt.
552
00:27:33.915 --> 00:27:36.535
And quickly just send this over and ask for two responses
553
00:27:36.535 --> 00:27:39.055
and you'll see how I get two responses from g pt
554
00:27:39.055 --> 00:27:40.575
four to the same prompt.
555
00:27:51.085 --> 00:27:53.415
Okay. All right. So here's my first one.
556
00:27:53.615 --> 00:27:55.735
I think it's just taking a moment to return the second one,
557
00:27:55.735 --> 00:27:57.015
but it'll, when it's done, it should
558
00:27:57.015 --> 00:27:58.135
appear right here beneath.
559
00:27:58.835 --> 00:28:03.575
Um, Amanda, do we have time for some bonus, bonus tips
560
00:28:03.575 --> 00:28:05.055
after I give a quick run through
561
00:28:05.055 --> 00:28:06.535
of the five tips we went through already?
562
00:28:07.395 --> 00:28:10.815
We have about 10 minutes left for your section. Mm-Hmm.
563
00:28:11.105 --> 00:28:13.655
Great. So just to recap our five tips today,
564
00:28:13.825 --> 00:28:18.615
we're adding a persona, getting specific, adding delimiters
565
00:28:18.615 --> 00:28:21.655
and sections to your prompt, providing an example.
566
00:28:22.475 --> 00:28:25.695
And then our last one was listing out the steps you would
567
00:28:25.695 --> 00:28:27.295
like the generative AI model to take.
568
00:28:28.155 --> 00:28:30.175
So I'm gonna stop sharing my screen
569
00:28:37.045 --> 00:28:38.825
Did have a question about models,
570
00:28:38.875 --> 00:28:40.305
which I know you're gonna get to.
571
00:28:40.485 --> 00:28:42.385
And, uh, but they ask, curious
572
00:28:42.445 --> 00:28:44.745
to know if you have a favorite or most used model.
573
00:28:45.535 --> 00:28:50.185
Okay, I do. Um, GPT-4 Omni is my favorite model right now.
574
00:28:50.565 --> 00:28:54.155
Um, you know, Amanda, I'm having a hard time figuring out
575
00:28:54.215 --> 00:28:55.435
how to stop sharing.
576
00:28:56.695 --> 00:28:58.355
Uh, I think it's at the top of the screen.
577
00:28:58.505 --> 00:29:00.315
It's should I be? Thank you,
578
00:29:00.805 --> 00:29:01.805
Thank you. I per,
579
00:29:01.805 --> 00:29:02.355
580
00:29:02.645 --> 00:29:04.795
We're not a team organization, I mean,
581
00:29:04.825 --> 00:29:06.995
assumes the organization's. No worries. No,
582
00:29:07.335 --> 00:29:08.335
No. Okay.
583
00:29:08.335 --> 00:29:10.315
So I have some bonus tips today. Okay.
584
00:29:10.775 --> 00:29:14.755
Um, one is you may not wanna have to outline all the steps
585
00:29:14.775 --> 00:29:15.995
for the generative AI model,
586
00:29:16.015 --> 00:29:17.715
but you might really want it to think it through.
587
00:29:17.935 --> 00:29:20.155
So you can just simply add wording to your prompt.
588
00:29:20.155 --> 00:29:22.995
That is something like think through this step by step
589
00:29:23.095 --> 00:29:24.475
and the model's gonna slow down
590
00:29:24.735 --> 00:29:27.035
and it's gonna share with you its thought process,
591
00:29:27.295 --> 00:29:30.115
and it's gonna work through it in a, in a more kind
592
00:29:30.115 --> 00:29:34.195
of defined and methodol methodological kind of way.
593
00:29:35.415 --> 00:29:38.125
Bonus tip number two is when I write
594
00:29:38.125 --> 00:29:39.525
emails, I don't bury the lead.
595
00:29:39.605 --> 00:29:41.045
I put the most important piece first.
596
00:29:41.395 --> 00:29:43.125
Sometimes I repeat it at the end.
597
00:29:43.505 --> 00:29:44.845
You can do the same with your model.
598
00:29:44.845 --> 00:29:47.605
If you wanna make sure it does not miss this piece, it,
599
00:29:48.235 --> 00:29:49.565
it's going to concentrate there.
600
00:29:51.415 --> 00:29:53.315
Tip number three, and it's my last bonus tip.
601
00:29:54.265 --> 00:29:55.795
It's continue the conversation.
602
00:29:56.415 --> 00:29:58.275
So you do not have to send a mega prompt,
603
00:29:58.435 --> 00:30:00.635
a really large prompt with everything in it the way
604
00:30:00.635 --> 00:30:02.715
that we were doing today, just for speed.
605
00:30:03.335 --> 00:30:06.515
Um, you can break it up into smaller prompts
606
00:30:06.895 --> 00:30:10.435
and kind of continue informing the model to make tweaks
607
00:30:10.495 --> 00:30:13.155
to the same output so you can have the model kind
608
00:30:13.155 --> 00:30:14.395
of just keep amending and adding
609
00:30:14.415 --> 00:30:15.595
to the output that it's creating.
610
00:30:16.655 --> 00:30:19.195
Amanda, do we have time to go into some model comparison
611
00:30:19.255 --> 00:30:21.235
to answer some of those questions that were coming through?
612
00:30:21.895 --> 00:30:24.235
We do. And we have a question. I really like this one.
613
00:30:24.375 --> 00:30:27.915
So, uh, John asked, how specific do you find it useful
614
00:30:27.975 --> 00:30:30.395
to get in terms of the context persona
615
00:30:30.395 --> 00:30:31.515
at the beginning of the prompt?
616
00:30:33.055 --> 00:30:35.015
I find it really useful. Yeah.
617
00:30:35.035 --> 00:30:38.555
Um, so for example,
618
00:30:40.565 --> 00:30:45.105
if I fed, if I fed that generative AI a piece of text
619
00:30:45.165 --> 00:30:49.625
and I said edit, edit this text, they may rewrite it,
620
00:30:49.775 --> 00:30:51.625
they may take awkward sentences and make them better.
621
00:30:51.625 --> 00:30:53.705
But you know what? I had specific wording in there.
622
00:30:53.985 --> 00:30:55.985
I had things exactly the way I wanted it.
623
00:30:56.265 --> 00:30:58.345
I really just wanted it to find my grammar issues.
624
00:30:58.925 --> 00:31:00.745
Um, and I find by giving its persona
625
00:31:00.745 --> 00:31:02.465
and telling its role, it's going to,
626
00:31:02.575 --> 00:31:05.825
it's gonna do a better job at meeting my expectations. Um,
627
00:31:06.825 --> 00:31:07.825
I definitely agree.
628
00:31:08.365 --> 00:31:10.025
Um, yeah, we do have time, uh,
629
00:31:10.025 --> 00:31:11.825
we've got about 10 minutes if you wanna go
630
00:31:11.825 --> 00:31:12.985
through model comparisons.
631
00:31:13.615 --> 00:31:14.825
Yeah. Okay.
632
00:31:15.165 --> 00:31:18.145
So we just saw in real time how different models responded
633
00:31:18.145 --> 00:31:21.545
to the same prompts in that, in the Spark AI playground.
634
00:31:22.245 --> 00:31:24.575
Um, I'd like to talk about some of those,
635
00:31:24.575 --> 00:31:27.095
those main differences and models fall into kind
636
00:31:27.095 --> 00:31:28.415
of three major categories.
637
00:31:28.955 --> 00:31:32.415
So the first is decoder models, and that's like GPT,
638
00:31:32.955 --> 00:31:35.095
and they're really good at generating content.
639
00:31:35.635 --> 00:31:39.015
So why is GPT-4 omni my favorite model right now?
640
00:31:40.035 --> 00:31:43.175
I'm finding it's just really good at text completion tasks,
641
00:31:43.175 --> 00:31:45.815
especially generating test questions that I need
642
00:31:45.815 --> 00:31:47.575
for like demos that I'm doing for clients.
643
00:31:47.955 --> 00:31:50.695
So it's really my go-to for generating content.
644
00:31:52.275 --> 00:31:55.935
So also within the decoder kind of category of models,
645
00:31:56.555 --> 00:31:59.615
you have models like AI 21 Labs Jurassic,
646
00:31:59.615 --> 00:32:01.255
and I'm hearing really good things about
647
00:32:01.255 --> 00:32:02.735
that model for translation.
648
00:32:03.995 --> 00:32:07.615
So you also have, um, anthros, Claude
649
00:32:07.795 --> 00:32:09.655
and I did have a, you know,
650
00:32:10.175 --> 00:32:12.935
feedback from a colleague on Claude sharing
651
00:32:12.935 --> 00:32:15.255
that this model was particularly good at creative writing.
652
00:32:15.395 --> 00:32:18.215
So they were seeing great uses for it in the K to 12 space
653
00:32:18.215 --> 00:32:19.535
where you're writing kind of creative
654
00:32:19.535 --> 00:32:20.855
writing for reading comprehension.
655
00:32:22.085 --> 00:32:23.415
Then you have another category
656
00:32:23.415 --> 00:32:25.175
of models, um, encoder models.
657
00:32:25.795 --> 00:32:27.575
And these models like Bert,
658
00:32:28.205 --> 00:32:30.415
they're really good at understanding relationships
659
00:32:30.875 --> 00:32:32.935
and they're, they're models you may wanna look into if
660
00:32:32.935 --> 00:32:35.735
you're looking to analyze data and categorize data
661
00:32:35.735 --> 00:32:38.495
and classify data so they have strengths in that area.
662
00:32:39.865 --> 00:32:43.405
The third main category is your encoder decoder models.
663
00:32:43.435 --> 00:32:46.645
They're both, so they have strengths in both kind of
664
00:32:46.645 --> 00:32:48.925
that noticing relationships and generating content.
665
00:32:49.505 --> 00:32:51.405
And these are models like Google Gemini,
666
00:32:51.405 --> 00:32:52.525
which you saw working today.
667
00:32:53.895 --> 00:32:56.915
Um, now AI providers, they are eager
668
00:32:56.915 --> 00:32:58.315
to tell you about their models.
669
00:32:58.655 --> 00:33:00.275
So if you just Google the provider
670
00:33:00.275 --> 00:33:02.435
and you go to their main site, they're gonna tell you
671
00:33:02.435 --> 00:33:04.195
what they think, each model that they,
672
00:33:04.385 --> 00:33:06.835
they have on offer is particularly good at.
673
00:33:07.655 --> 00:33:10.235
And then after you read that, come to the AI playground
674
00:33:10.575 --> 00:33:11.795
and then select those models
675
00:33:11.935 --> 00:33:16.035
and see how they, how they compare when you send them your
676
00:33:16.275 --> 00:33:17.475
objectives and your tasks.
677
00:33:17.535 --> 00:33:20.595
So which model is the best for your purposes?
678
00:33:21.575 --> 00:33:23.515
So it's one of the great benefits
679
00:33:23.515 --> 00:33:26.035
of having the AI playground available both internally
680
00:33:26.215 --> 00:33:30.515
and now externally to our clients after performance.
681
00:33:30.515 --> 00:33:31.755
There's a couple other things that,
682
00:33:31.755 --> 00:33:32.795
that you may wanna consider.
683
00:33:33.255 --> 00:33:34.915
Um, one would be cost.
684
00:33:35.295 --> 00:33:38.555
So the, the AI providers do charge a cost
685
00:33:38.815 --> 00:33:40.195
for using their models,
686
00:33:40.615 --> 00:33:43.795
and they charge a cost for the size of the input.
687
00:33:43.815 --> 00:33:44.915
So the size of your prompt,
688
00:33:45.375 --> 00:33:47.875
and then also the size of the response that's coming back.
689
00:33:48.655 --> 00:33:52.915
Now, what we found, um, working with models for a IG is
690
00:33:52.915 --> 00:33:54.475
that the cost range for the models
691
00:33:54.475 --> 00:33:56.115
that we support wasn't that huge.
692
00:33:56.615 --> 00:33:59.035
Um, one of my colleagues, Chris Glackin, found
693
00:33:59.105 --> 00:34:01.995
that we could generate 2000 multiple choice questions
694
00:34:01.995 --> 00:34:03.395
for less than $32.
695
00:34:04.065 --> 00:34:05.915
Okay? So I, I personally find cost
696
00:34:05.915 --> 00:34:07.315
as a lower, lower concern.
697
00:34:08.795 --> 00:34:11.135
Now, these different providers provide different support
698
00:34:11.175 --> 00:34:13.815
toolings, so it's something to go look at on their website.
699
00:34:14.475 --> 00:34:17.695
Um, you may be interested in say fine tuning a model,
700
00:34:17.915 --> 00:34:21.095
and some of these providers have tools to help you do that.
701
00:34:21.355 --> 00:34:23.095
So what is fine tuning a model?
702
00:34:23.685 --> 00:34:26.095
Fine tuning a model is when you have a data set,
703
00:34:26.355 --> 00:34:29.935
an extra data set, that's your domain, that's your data,
704
00:34:30.755 --> 00:34:32.495
and you want to feed it to one
705
00:34:32.495 --> 00:34:37.135
of these foundation models like GBT four or Gemini.
706
00:34:37.675 --> 00:34:39.815
So you wanna do further training on those models so
707
00:34:39.815 --> 00:34:43.735
that it can specialize in your tasks and your terminology.
708
00:34:44.765 --> 00:34:48.095
Okay. So there's different costs for using those tools
709
00:34:48.355 --> 00:34:49.655
and fine tuning a model
710
00:34:49.675 --> 00:34:51.655
and those fine tuned models, they then become
711
00:34:51.655 --> 00:34:52.895
available just for your use.
712
00:34:53.355 --> 00:34:56.135
So Amanda, didn't our colleagues do a, a webinar
713
00:34:56.855 --> 00:35:01.565
recently on fine tuning and rag and those ideas? Yeah, two,
714
00:35:01.665 --> 00:35:05.085
Two weeks ago, um, they led one, one rag and fine tuning
715
00:35:05.185 --> 00:35:06.765
and it was super in depth.
716
00:35:07.265 --> 00:35:10.125
Um, that is on the website if anyone wants to revisit that.
717
00:35:10.145 --> 00:35:12.685
And they show the demo of how to actually do it,
718
00:35:13.225 --> 00:35:15.005
um, which is really cool.
719
00:35:15.305 --> 00:35:17.165
You do have a question in the chat if
720
00:35:17.165 --> 00:35:18.525
you want to answer this.
721
00:35:18.615 --> 00:35:20.485
We've got four repetitive tasks.
722
00:35:21.025 --> 00:35:23.565
Do you recom do you recommend on how
723
00:35:23.585 --> 00:35:26.965
to retain the prompt instructions for multiple requests?
724
00:35:27.705 --> 00:35:31.365
How to retain it? Yeah. Okay. Oh, that's a good question.
725
00:35:32.505 --> 00:35:33.845
That's a really good question.
726
00:35:34.545 --> 00:35:37.325
Um, so it depends what that repetitive task is
727
00:35:37.325 --> 00:35:38.445
and who's calling it.
728
00:35:38.865 --> 00:35:42.405
Um, so I know I work, I work a, a set of product managers
729
00:35:42.665 --> 00:35:46.605
and technical writer that keep a notebook of their tasks
730
00:35:47.225 --> 00:35:50.525
and they even write down how the response was.
731
00:35:50.585 --> 00:35:51.685
So they have their preferred tasks
732
00:35:51.685 --> 00:35:53.165
and the ones that are, uh, prompts
733
00:35:53.165 --> 00:35:55.605
and what's working for them so they can go back to them.
734
00:35:55.785 --> 00:35:57.285
So that's at an individual level.
735
00:35:58.025 --> 00:36:00.125
Um, and then organizations, I know a lot
736
00:36:00.125 --> 00:36:02.485
of them are coming up with their own handbooks
737
00:36:02.585 --> 00:36:05.205
and support materials, so they may have some input in
738
00:36:05.205 --> 00:36:08.445
how they want you to store off your, your prompts
739
00:36:08.745 --> 00:36:11.445
and share them with the remainder of your organization. Um,
740
00:36:11.635 --> 00:36:15.445
Yeah, from, um, in marketing we have a, a prompt library,
741
00:36:15.655 --> 00:36:16.805
which sounds really fancy,
742
00:36:16.945 --> 00:36:19.045
but it's just an Excel sheet sheet that says,
743
00:36:19.105 --> 00:36:22.525
here's the prompt we used, here's the use case for it so
744
00:36:22.525 --> 00:36:24.805
that we can come back if we like it, you know,
745
00:36:24.805 --> 00:36:26.485
if it worked well to use it again.
746
00:36:26.865 --> 00:36:29.205
Um, so we're not always starting from scratch, especially
747
00:36:29.205 --> 00:36:30.445
for those longer prompts.
748
00:36:34.025 --> 00:36:36.045
That's a great idea. Amanda, there
749
00:36:36.045 --> 00:36:37.605
Was another question that I'm not totally
750
00:36:37.755 --> 00:36:39.245
sure, um, how to answer.
751
00:36:39.425 --> 00:36:41.725
So I will pass it to you to see what you think.
752
00:36:42.225 --> 00:36:45.005
Um, are there plans to allow users
753
00:36:45.265 --> 00:36:48.565
to use report data from program workshop, um,
754
00:36:48.665 --> 00:36:50.125
in the AI playground?
755
00:36:50.375 --> 00:36:51.485
Which is a great question.
756
00:36:51.595 --> 00:36:54.285
Okay, that's, that's an incredible question. Yeah.
757
00:36:54.585 --> 00:36:56.205
Um, I am gonna have to share
758
00:36:56.205 --> 00:36:58.125
that I am an item workshop product manager,
759
00:36:58.305 --> 00:37:01.325
so I don't work on program workshop, so I can make sure
760
00:37:01.325 --> 00:37:02.845
that question gets over to,
761
00:37:03.345 --> 00:37:05.085
to the product managers in my department
762
00:37:05.085 --> 00:37:06.245
that work on that application.
763
00:37:06.645 --> 00:37:10.065
'cause to be frank, I'm not sure on the specifics
764
00:37:10.065 --> 00:37:12.305
of their roadmap and what they have planning.
765
00:37:15.515 --> 00:37:16.575
That's great. Those are the
766
00:37:16.575 --> 00:37:17.855
two questions we have right now.
767
00:37:18.515 --> 00:37:20.655
Um, we do have a few more minutes if there
768
00:37:20.655 --> 00:37:21.935
was anything else you wanted to cover.
769
00:37:22.395 --> 00:37:24.495
Um, otherwise we can open it up to q and a.
770
00:37:24.515 --> 00:37:25.695
If you have a question,
771
00:37:25.695 --> 00:37:27.695
you can either reuse the raise your hand feature,
772
00:37:27.705 --> 00:37:31.135
which should be available to you, um, or put it in the chat.
773
00:37:32.105 --> 00:37:34.775
Thank you, Amanda. This has been so much fun. This
774
00:37:34.775 --> 00:37:35.775
Has been really great.
775
00:37:36.235 --> 00:37:38.935
Um, was there anything else that you maybe wanted
776
00:37:38.935 --> 00:37:40.415
to cover today or,
777
00:37:40.475 --> 00:37:41.975
it seems like we went through a whole lot.
778
00:37:43.045 --> 00:37:45.095
Yeah, I'm, I think I shared everything
779
00:37:45.095 --> 00:37:46.175
that I had on my mind today,
780
00:37:46.175 --> 00:37:47.535
so I'd love to know if there's any questions,
781
00:37:49.845 --> 00:37:50.845
Hand raises.
782
00:37:52.085 --> 00:37:53.415
Okay. No.
783
00:37:53.925 --> 00:37:56.495
Well then with that I will just do a quick closing
784
00:37:56.635 --> 00:37:59.295
and say thank you to everyone for being here with us.
785
00:37:59.755 --> 00:38:02.375
Uh, we do have part four of our sessions,
786
00:38:02.375 --> 00:38:04.375
our live sessions going live next week.
787
00:38:04.725 --> 00:38:06.255
It's August 6th at 12.
788
00:38:06.725 --> 00:38:08.775
That session is on data analysis
789
00:38:08.955 --> 00:38:10.735
and optimizing survey results.
790
00:38:10.955 --> 00:38:13.015
So if you have candidates that are taking a survey,
791
00:38:13.155 --> 00:38:14.735
you really don't wanna miss this one.
792
00:38:15.195 --> 00:38:16.655
Um, it is always recorded,
793
00:38:16.755 --> 00:38:18.295
but we hope that you join us live,
794
00:38:18.295 --> 00:38:19.895
especially if you have any questions.
795
00:38:20.435 --> 00:38:22.815
Um, that's with Jeffrey Za, he's our director
796
00:38:22.815 --> 00:38:26.215
of product management and create both our product manager,
797
00:38:27.035 --> 00:38:31.655
um, you would visit testis.com/webinars to sign up,
798
00:38:31.955 --> 00:38:34.255
um, and see all the previous recordings as well.
799
00:38:35.105 --> 00:38:38.805
So with that, um, oh, we had a good thank you.
800
00:38:38.985 --> 00:38:40.805
You're welcome. Mark, thanks for joining us.
801
00:38:41.425 --> 00:38:44.085
Um, and again, if you have questions, just reach out.
802
00:38:44.465 --> 00:38:46.285
Uh, thank you for being here with us today,
803
00:38:46.345 --> 00:38:47.845
and we will see you at the next session.
804
00:38:47.935 --> 00:38:51.485
Thank you, Andrea. Thank you. Bye everyone.