WEBVTT

00:00:00.000 --> 00:00:15.970 align:middle line:90%


00:00:15.970 --> 00:00:18.970 align:middle line:84%
Hello, and welcome
to a quick tutorial

00:00:18.970 --> 00:00:23.620 align:middle line:84%
that is going to take a look at
the textbook dataset in Excel.

00:00:23.620 --> 00:00:25.540 align:middle line:84%
Now, before we get
started, I do want

00:00:25.540 --> 00:00:28.630 align:middle line:84%
to talk a little bit about Excel
and how Excel is set up it.

00:00:28.630 --> 00:00:30.220 align:middle line:90%
It is a giant spreadsheet.

00:00:30.220 --> 00:00:31.930 align:middle line:90%
For our purposes, the columns--

00:00:31.930 --> 00:00:34.090 align:middle line:84%
as you can see here, I'm
pointing to the columns--

00:00:34.090 --> 00:00:36.580 align:middle line:84%
each of these are going
to be individual items.

00:00:36.580 --> 00:00:39.130 align:middle line:84%
Now, these are going to
correspond with the survey

00:00:39.130 --> 00:00:41.380 align:middle line:84%
that we collected for
our book, Quantitative

00:00:41.380 --> 00:00:44.260 align:middle line:84%
Research for Communication:
A Hands-On Approach.

00:00:44.260 --> 00:00:46.180 align:middle line:84%
So these are the
individual items.

00:00:46.180 --> 00:00:49.780 align:middle line:84%
Now, the rows are going to be,
starting with number 2 here,

00:00:49.780 --> 00:00:52.450 align:middle line:84%
are the participants
who completed this.

00:00:52.450 --> 00:00:54.860 align:middle line:84%
For example, we have
person number 1 right here.

00:00:54.860 --> 00:00:56.830 align:middle line:84%
That's why you see the
ID next to their name.

00:00:56.830 --> 00:00:59.560 align:middle line:84%
That was the order they were
entered and we collected them.

00:00:59.560 --> 00:01:03.520 align:middle line:84%
So their first score for the
very first item on the PRCA-24

00:01:03.520 --> 00:01:04.629 align:middle line:90%
was a 2.

00:01:04.629 --> 00:01:07.840 align:middle line:84%
Their last score, if we
come way over here, was a 1.

00:01:07.840 --> 00:01:10.150 align:middle line:84%
So you can see that
they have 24 items.

00:01:10.150 --> 00:01:14.440 align:middle line:84%
Here is the entire
survey for the PRCA-24.

00:01:14.440 --> 00:01:17.200 align:middle line:84%
All 24 of those items,
when added together,

00:01:17.200 --> 00:01:20.530 align:middle line:84%
create the composite score for
communication apprehension,

00:01:20.530 --> 00:01:24.400 align:middle line:84%
or the Personal Report of
Communication Apprehension 24.

00:01:24.400 --> 00:01:26.870 align:middle line:84%
And you can go through
and look at that.

00:01:26.870 --> 00:01:28.750 align:middle line:84%
We also come over
to the right, where

00:01:28.750 --> 00:01:30.490 align:middle line:84%
we start seeing some
of the variables

00:01:30.490 --> 00:01:32.600 align:middle line:90%
that we would actually utilize.

00:01:32.600 --> 00:01:35.920 align:middle line:84%
For example, you're going to see
here starting with sex, the sex

00:01:35.920 --> 00:01:37.330 align:middle line:90%
column right here.

00:01:37.330 --> 00:01:38.560 align:middle line:90%
This is biological sex.

00:01:38.560 --> 00:01:41.140 align:middle line:84%
Then we have their political
affiliation, their class,

00:01:41.140 --> 00:01:43.150 align:middle line:84%
how much time they spend
online, their age, what

00:01:43.150 --> 00:01:45.410 align:middle line:84%
edition of the book
was the data collected.

00:01:45.410 --> 00:01:47.170 align:middle line:84%
Then we get to these
composite scores.

00:01:47.170 --> 00:01:49.780 align:middle line:84%
So we were looking at those
24 items for the Communication

00:01:49.780 --> 00:01:51.250 align:middle line:90%
Apprehension 24.

00:01:51.250 --> 00:01:53.020 align:middle line:90%
This is the composite score.

00:01:53.020 --> 00:01:56.110 align:middle line:84%
So for that very first
person, their composite score

00:01:56.110 --> 00:01:59.440 align:middle line:84%
when you added up all
their items was 61.

00:01:59.440 --> 00:02:01.960 align:middle line:84%
So this is kind of
the basic layout.

00:02:01.960 --> 00:02:04.690 align:middle line:84%
Now, for our purposes, what
we are going to be using

00:02:04.690 --> 00:02:07.706 align:middle line:84%
is a great little plug-in
called Statistician.

00:02:07.706 --> 00:02:09.789 align:middle line:84%
Now, I'm going to come
over here to their website.

00:02:09.789 --> 00:02:11.014 align:middle line:90%
This is Statistician.

00:02:11.014 --> 00:02:13.180 align:middle line:84%
And one of the things that
I like about Statistician

00:02:13.180 --> 00:02:15.250 align:middle line:90%
is, it's very easy to use.

00:02:15.250 --> 00:02:18.580 align:middle line:84%
And honestly, it's probably one
of the cheapest plug-ins I've

00:02:18.580 --> 00:02:20.590 align:middle line:90%
seen specifically for Excel.

00:02:20.590 --> 00:02:23.860 align:middle line:84%
The plug-in itself
is only $19.95.

00:02:23.860 --> 00:02:26.380 align:middle line:84%
Which for plug-ins in
Excel for statistics

00:02:26.380 --> 00:02:29.840 align:middle line:84%
is actually very,
very, very inexpensive.

00:02:29.840 --> 00:02:33.580 align:middle line:84%
So you can actually go to
their website at stataddin.com.

00:02:33.580 --> 00:02:35.420 align:middle line:84%
There are some other
ones that exist.

00:02:35.420 --> 00:02:38.620 align:middle line:84%
But they are going to be a lot
more expensive, which is why

00:02:38.620 --> 00:02:40.480 align:middle line:90%
we are looking at Statistician.

00:02:40.480 --> 00:02:44.331 align:middle line:84%
There are also some things
that you can get right in Excel

00:02:44.331 --> 00:02:44.830 align:middle line:90%
itself.

00:02:44.830 --> 00:02:46.690 align:middle line:84%
There is their
Data Functions tab.

00:02:46.690 --> 00:02:49.780 align:middle line:84%
Which you can download that
from the Office website.

00:02:49.780 --> 00:02:52.729 align:middle line:84%
So there are a lot of things
that are built into Excel.

00:02:52.729 --> 00:02:54.520 align:middle line:84%
You can look at all
the different formulas.

00:02:54.520 --> 00:02:56.740 align:middle line:84%
Like, if you come
here, you can look.

00:02:56.740 --> 00:02:58.150 align:middle line:90%
They have a bunch of formulas.

00:02:58.150 --> 00:03:00.280 align:middle line:84%
They even have some
stats formulas that

00:03:00.280 --> 00:03:02.120 align:middle line:90%
are actually built into this.

00:03:02.120 --> 00:03:04.030 align:middle line:90%
So that's nice too.

00:03:04.030 --> 00:03:07.450 align:middle line:84%
But again, I'm going to show you
why that Statistician add-in is

00:03:07.450 --> 00:03:09.650 align:middle line:84%
going to be your
friend later on.

00:03:09.650 --> 00:03:12.880 align:middle line:84%
Now, there are some inherent
limitations to Excel.

00:03:12.880 --> 00:03:15.880 align:middle line:84%
One of the biggest ones
is, it doesn't do well

00:03:15.880 --> 00:03:17.620 align:middle line:90%
when it comes to missing data.

00:03:17.620 --> 00:03:21.170 align:middle line:84%
Let's come down and look
at person number 22.

00:03:21.170 --> 00:03:23.800 align:middle line:84%
So you're going to see that
they didn't fill out pretty much

00:03:23.800 --> 00:03:25.690 align:middle line:90%
most of the PRCA-24.

00:03:25.690 --> 00:03:27.550 align:middle line:84%
They kind of started
only filling things out

00:03:27.550 --> 00:03:29.950 align:middle line:90%
when it came to ethnocentrism.

00:03:29.950 --> 00:03:32.380 align:middle line:84%
And so we end up with a
case where there's just

00:03:32.380 --> 00:03:34.890 align:middle line:90%
missing information from them.

00:03:34.890 --> 00:03:38.710 align:middle line:84%
So unfortunately, one of the
things that becomes problematic

00:03:38.710 --> 00:03:42.076 align:middle line:84%
is, Excel does not know how
to handle missing information.

00:03:42.076 --> 00:03:44.200 align:middle line:84%
There are some other programs
that you can download

00:03:44.200 --> 00:03:45.770 align:middle line:90%
that are very expensive.

00:03:45.770 --> 00:03:48.040 align:middle line:84%
But it's going to
inherently cause problems.

00:03:48.040 --> 00:03:50.890 align:middle line:84%
Because we just don't have
the ability to handle that.

00:03:50.890 --> 00:03:52.850 align:middle line:84%
For that reason,
this whole giant data

00:03:52.850 --> 00:03:54.940 align:middle line:84%
set that I have right
here in front of me

00:03:54.940 --> 00:03:59.020 align:middle line:84%
would not be able to be
analyzed easily in Excel.

00:03:59.020 --> 00:04:01.080 align:middle line:84%
So instead, one of the
things that I have here

00:04:01.080 --> 00:04:06.700 align:middle line:84%
is what is called the
shortened no missing data.

00:04:06.700 --> 00:04:09.340 align:middle line:84%
Basically, what the shortened
no missing data has done is,

00:04:09.340 --> 00:04:13.540 align:middle line:84%
I've gone through and deleted
any type of missing data

00:04:13.540 --> 00:04:15.700 align:middle line:90%
that existed within our file.

00:04:15.700 --> 00:04:17.649 align:middle line:90%
Now, here is the reality.

00:04:17.649 --> 00:04:20.974 align:middle line:84%
If we come over here to
the full textbook dataset--

00:04:20.974 --> 00:04:22.390 align:middle line:84%
scroll all the way
down, and we'll

00:04:22.390 --> 00:04:23.764 align:middle line:84%
see how many
participants we had.

00:04:23.764 --> 00:04:26.840 align:middle line:90%
We had 654 participants.

00:04:26.840 --> 00:04:28.990 align:middle line:84%
I come over here to
the shortened version.

00:04:28.990 --> 00:04:33.130 align:middle line:84%
You're going to see that
it drops down to 550.

00:04:33.130 --> 00:04:36.670 align:middle line:84%
So that means that, out
of the 654, almost 100

00:04:36.670 --> 00:04:41.780 align:middle line:84%
of our participants missed
at least one item somewhere.

00:04:41.780 --> 00:04:44.800 align:middle line:84%
And as such, they had
to be tossed out so

00:04:44.800 --> 00:04:48.042 align:middle line:84%
that Excel could be able
to handle these results.

00:04:48.042 --> 00:04:50.500 align:middle line:84%
And so it is one of those things
that's really frustrating.

00:04:50.500 --> 00:04:54.070 align:middle line:84%
That's also why, when it
comes to having complete data,

00:04:54.070 --> 00:04:55.300 align:middle line:90%
it really is useful.

00:04:55.300 --> 00:04:57.970 align:middle line:84%
Or to have a very, very
large robust data set.

00:04:57.970 --> 00:05:00.730 align:middle line:84%
So that if you do have to
throw out participants,

00:05:00.730 --> 00:05:04.900 align:middle line:84%
you're not going to see too
much change in your result.

00:05:04.900 --> 00:05:06.664 align:middle line:90%
But that is a factor.

00:05:06.664 --> 00:05:08.080 align:middle line:84%
And this is also
going to be where

00:05:08.080 --> 00:05:09.788 align:middle line:84%
some of the results
that we'll talk about

00:05:09.788 --> 00:05:11.980 align:middle line:84%
will greatly differ
from what we actually

00:05:11.980 --> 00:05:15.610 align:middle line:84%
see in the textbook, where we
were able to use missing data.

00:05:15.610 --> 00:05:18.460 align:middle line:84%
Which is, again, just
something that Excel itself

00:05:18.460 --> 00:05:21.850 align:middle line:84%
does not have the
ability to utilize.

00:05:21.850 --> 00:05:24.584 align:middle line:84%
So I recommend using the
textbook data shortened.

00:05:24.584 --> 00:05:26.500 align:middle line:84%
One of the other things
you're going to notice

00:05:26.500 --> 00:05:29.410 align:middle line:84%
is, I got rid of the
individual items.

00:05:29.410 --> 00:05:31.914 align:middle line:84%
So when we go back up
here to the very top,

00:05:31.914 --> 00:05:33.580 align:middle line:84%
you'll see all of
these individual items

00:05:33.580 --> 00:05:35.050 align:middle line:90%
for each of the scales.

00:05:35.050 --> 00:05:37.630 align:middle line:84%
I just went and started
this dataset with sex.

00:05:37.630 --> 00:05:39.899 align:middle line:84%
And then I get into
those summed scores.

00:05:39.899 --> 00:05:41.440 align:middle line:84%
So this is just a
little bit cleaner.

00:05:41.440 --> 00:05:43.270 align:middle line:84%
It's a little bit
easier to look at.

00:05:43.270 --> 00:05:44.770 align:middle line:84%
And so it's definitely
going to make

00:05:44.770 --> 00:05:47.680 align:middle line:84%
our lives a lot easier when
trying to handle statistics

00:05:47.680 --> 00:05:49.620 align:middle line:90%
using Excel.

00:05:49.620 --> 00:06:03.112 align:middle line:90%