Device Divide

Author

Yekta Amirkhalili

Published

June 23, 2025

Data Analysis

In this project, I focused on analyzing how mental health relates to mobile banking adoption. I used data from the Canadian Internet Use Survey 2022, which includes questions about various digital habits, and demographics. You can find the dataset here.

For this project, I conducted a comparative analysis of two logistic regression models one for smartphone users, refered to from here on out and PHONE users and one for smart wearable users, refered to as WEAR users. Since the sampling methodology involved clustering by provinces, I considered robust standard errors for reporting the results. To build the models, I needed to conceptualize technology, in this case, m-banking adoption. Since I’m considering two different devices, I needed factors that impact m-banking decisions for both devices to be able to compare them.

This was very challenging because there are not many m-banking using smartwearable device studies! So, I broadened the scope to consider any technology adoption. This is ok to do as long as the factors are not specific to a niche context. The variables are Trust, Perceived Security, Perceived Value, and few demographic varaibles such as Age, Gender, Education and Income.

Here are my hypotheses:

H1: The association between Trust and m-banking adoption is the same for smartphone and smart wearable users
H2: The association between Perceived security and m-banking adoption is the same for smartphone and smart wearable users.
H3: The association between Perceived value (measured by time savings) and m-banking adoption is the same for smartphone and smart wearable users.
- H4.1 : The association between Age and m-banking adoption is the same for smartphone and smart wearable users.
- H4.2 : The association between Gender and m-banking adoption is the same for smartphone and smart wearable users.
- H4.3 : The association between Education and m-banking adoption is the same for smartphone and smart wearable users.
- H4.4 : The association between Income and m-banking adoption is the same for smartphone and smart wearable users.

Importing Libraries

Note that not all libraries may be utilized. The most important ones are dplyr, lme4, tidyr, lavaan, ggplot2, psych, corrr, haven, poLCA and any related libraries to these.

library(corrr)
library(psych)
library(lavaan)
library(dplyr)
library(tidyr)
library(ggplot2)
library(haven)
library(rempsyc)
library(broom)
library(report)
library(effectsize)
library(aod)
library(readr)
library(forcats)
library(ggcorrplot)
library(caret)
library(knitr)
library(ROCR)
library(jtools)
library(xtable)
library(glmnet)
library(ggpubr)
library(lme4)
library(nlme)
library(weights)
library(miscTools)
library(systemfit)
library(multcomp)
require(ggplot2)
require(GGally)
require(reshape2)
require(lattice)
library(HLMdiag)
library(margins)
library(performance)
library(ggnewscale)
library(ggeffects)
library(ggeffects)
library(marginaleffects)
library(effects)
library(margins)
library(modelr)
library(plm)
library(effectsize)
library(aod)
library(readr)
library(tidymodels)
library(ggcorrplot)
library(glmnet)
library(ggpubr)
library(foreign)
library(AER)
library(lme4)
library(formatR)
library(pglm)
library(acqr)
library(lmtest)
library(poLCA)
library(mirt)
library(texreg)
library(gt)

Introducing the CIUS 2022

This dataset is very similar to CIUS 2020 from study 2. I first started by reading the entire PUMF file available.
This gives you information on how the survey was set up, why, and how things were measured. Then, I looked at the individual survey questions to see the available data, and how they were measured. In general, questions are measured numerically were answeres follow as such:

Yes : 1 No : 2
Valid Skip: 6 Don’t Know: 7 Refusal: 8 Not Stated: 9

Of course this differs question-by-question as some questions have other answer categories and some questions (which were note used in my study) asked for numerical input from the participants (like how much did you spend online last year). To help readers understand the data, I will include the question exactly as it appears in the CIUS 2022 PUMF Data Dictionary with corresponding answer choices and codes. These will be in a blue-bordered box, and will include the Variable name (on the PUMF file), Concept, Question Body and Answers. Then I will show you in R code how I’ve re-coded and used the question as a model variable. The Variables I need are as follows:

Mobile banking adoption (MBANK)
Province (PRVNC)
Age Group (AGE)
Gender (SEX)
Education Level (EDU)
Income Quintile (INCOME)
User Type (USR_TYP) - based on the following
- Smartphone User (isSmartPhone)
- Smartwearable User (isSmartWear)
Saved Time Because of m-banking (EFF_TIME)
Perceived Security (PSEC) - based on the following
- Security measure: restricting access to location (SEC_RES_LOC)
- Security measure: restricting access to data (SEC_RES_DAT)
- Security check: checked security of a website (SEC_ACC_WEBSEC)
- Security check: changed privacy settings (SEC_ACC_CHNGPRV)
- Security feature: security questions (SECOPT_QS)
- Security feature: partner login (SECOPT_PL)
- Security feature: two factor authentication (SECOPT_2FA)
- Security feature: biometric (SECOPT_BIO)
- Security feature: password manager (SECOPT_PAS)
Trust in Banks (TRST_BANK)
Family Relation Satisfaction (FAMSAT)

The data is available in various formats. To avoid data loss, I decided to use the .dta format (SAS file). You need the haven package to read SAS files. This is how you’d read a SAS file:

data_2022 <- read_dta("data/00_CIUS2022.dta")
ds00 <- data_2022
dim(ds00)

[1] 25118   342

ds0 <- ds00

Constructing Model Variables

Renaming variables
Cleaning data: delete the skips and such for both categorical and numerical variables
Verifying that our measure of latent constructs are strong enough

Demographic Variables

Since these are not direct questions but information retrieved from other sources (such as postal codes for province), some of them do not have Skips/Don’t Know/Refusal/Not Stated answers. If they have, I have added those to the cards.

Province, Age, Sex, Education, Income:

Variable Name: PROVINCE

Concept: PROVINCE

Question Text/Note:
Information derived using postal codes.

Answer Categories	Code
Newfoundland and Labrador	10
Prince Edward Island	11
Nova Scotia	12
New Brunswick	13
Quebec	24
Ontario	35
Manitoba	46
Saskatchewan	47
Alberta	48
British Columbia	59
Valid skip	96
Don’t know	97
Refusal	98
Not stated	99

Variable Name: AGE_GRP

Concept: Age Groups - Derived variable

Question Text/Note:
Information derived from age of persons in household.

Answer Categories	Code
15 to 24 years	01
25 to 34 years	02
35 to 44 years	03
45 to 54 years	04
55 to 64 years	05
65 years and over	06
Valid skip	96
Don’t know	97
Refusal	98
Not stated	99

Variable Name: GENDER

Concept: Gender - Derived variable

Question Text/Note:

Refers to current gender which may be different from sex assigned at birth and may be different from what is indicated on legal documents. For data quality and confidentiality reasons, and because of the small population being measured, the dissemination of data according to ’Non binary’ Gender is not possible for this statistical program. So, this release uses a gender variable with only two categories. This variable is derived by looking at a large number of demographic characteristics from the respondent, it allows us to disseminate data on Gender that is reliable and unbiased.

Answer Categories	Code
Male	1
Female	2
Valid skip	6
Don’t know	7
Refusal	8
Not stated	9

Variable Name: EMP

Concept: Employment status - Derived variable

Question Text/Note:

Answer Categories	Code
Employed	1
Not employed	2
Valid skip	6
Don’t know	7
Refusal	8
Not stated	9

Variable Name: EDU

Concept: Highest certificate - Derived variable

Question Text/Note:

Answer Categories	Code
High school or less	1
Some post-secondary (incl. univ certificate)	2
University degree	3
Valid skip	6
Don’t know	7
Refusal	8
Not stated	9

Variable Name: HINCQUIN

Concept: Census family income quintile - Derived variable

Question Text/Note:

Information derived using HINC. In order to obtain equal weighted counts in each category, cases with incomes equal to the category cutoffs were randomly assigned to one of the two categories on either side of the cutoff.

Source Annual Income Estimates for Census Families and Individuals (T1 Family File)

Answer Categories	Code
Quintile 1 - \leq $42,256	1
Quintile 2 - $42,257 - $72,366	2
Quintile 3 - $72,367 - $107,480	3
Quintile 4 - $107,481 - $163,750	4
Quintile 5 - > $163,750	5
Valid skip	6
Don’t know	7
Refusal	8
Not stated	9

ds0 <- ds0 %>% mutate(
    
    ID = as.factor(pumfid),
    PRVNC = case_when(
        province == 10 ~ "NL",
        province == 11 ~ "PEI",
        province == 12 ~ "NS",
        province == 13 ~ "NB",
        province == 24 ~ "QC",
        province == 35 ~ "ON",
        province == 46 ~ "MB",
        province == 47 ~ "SK",
        province == 48 ~ "AB",
        province == 59 ~ "BC",
        .default = "default"
    ),

    AGE = ifelse(
        AGE_GRP > 10,
        0,
        AGE_GRP
    ),
    
    SEX = case_when(
        gender == 1 ~ 0, #"M",
        gender == 2 ~ 1, #"F",
        .default = -1 #"default" #other
    ),
    
    EMP = case_when(
        emp == 1 ~ 1,
        emp == 2 ~ 0, #no
        .default = -1
    ),
    
    EDU = case_when(
        edu == 1 ~ 1, #"Highschool",
        edu == 2 ~ 2, #"College",
        edu == 3 ~ 3, #"University",
        .default = 0 #"default"
    ),
    
    INCOME = case_when(
        hincquin == 1 ~ 1, #"Q1",
        hincquin == 2 ~ 2, #"Q2",
        hincquin == 3 ~ 3, #"Q3",
        hincquin == 4 ~ 4, #"Q4",
        hincquin == 5 ~ 5, #"Q5",
        .default = 0 #"default"
    )
)

Other Model Variables

Devices and Mbanking:

Variable Name: DV_010A

Concept: Devices used

Question Text/Note:
During the past three months, what devices did you use to access the Internet? Did you use: A smartphone

Answer Categories	Code
Yes	1
No	2
Valid skip	6
Don’t know	7
Refusal	8
Not stated	9

Variable Name: DV_010G

Concept: Devices used

Question Text/Note:
During the past three months, what devices did you use to access the Internet? Did you use: Internet-connected wearable smart devices

Answer Categories	Code
Yes	1
No	2
Valid skip	6
Don’t know	7
Refusal	8
Not stated	9

CIUS’s Microdata User Guide has a section (section 4. Concepts and Defintions) but it does not include a definition for Internet connected smart wearable devices. Using the internet, some examples of these smart wearable devices are:

Smart glasses
Smart watch
Fitness Trackers
Smart Shirt
GPS devices (SGPS/GPRS Body Control)
Bluetooth Key Trackers
Smart Belts
Smart Rings
Smart Bracelets
Virtual Reality devices
Smart clothing

More specific to Canada, according to Ingenium.ca, the top devices are:

Smartwatches (Apple Watch, Samsung Galaxy Watch and Fitbits)
Fitness Trackers (Fitbit, Garmin)
Health Monitoring Devices (continuous glucose monitors)

Also true from CIUS 2020’s report:

In addition, 14% of Canadians used Internet-connected wearable smart devices, such as a smart watch, Fit Bit or glucose monitoring device

It’s safe to assume that smart wearables most definitely include smartwatches.

Variable Name: UI_050D

Concept: Activities related to other online activities

Question Text/Note:
During the past three months, which of the following other online activities, have you done over the Internet? Have you: Conducted online banking

Answer Categories	Code
Yes	1
No	2
Valid skip	6
Don’t know	7
Refusal	8
Not stated	9

ds0 <- ds0 %>% mutate(
    SMRTPHN = case_when(
        DV_010A == 1 ~ 1,
        DV_010A == 2 ~ 0,
        .default = -1
    ),
    
    SMRTWTCH = case_when(
        DV_010G == 1 ~ 1,
        DV_010G == 2 ~ 0,
        .default = -1
    ),
    
    MBANK = case_when(
        UI_050D == 1 ~ 1,
        UI_050D == 2 ~ 0,
        .default = -1
    )
)

Time Saving Effects

Variable Name: UI_110E

Concept: Effects of the use of online activities

Question Text/Note:
During the past 12 months, did your use of online activities have any of the following effects? Did it: Save you time

Answer Categories	Code
Yes	1
No	2
Valid skip	6
Don’t know	7
Refusal	8
Not stated	9

ds0 <- ds0 %>% mutate(
    EFF_TIME = case_when(
        UI_110E == 1 ~ 1,
        UI_110E == 2 ~ 0,
        .default = -1
    )
)

And since the online activity I’m considering is mobile banking, this would be about mobile banking (more on this in my paper, as it’s not entirely true - this is a limitation).

Security

Variable Name: SP_010A

Concept: Activities carried out to manage access to personal data

Question Text/Note:
Have you carried out any of the following to manage access to your personal data over the Internet during the past 12 months? Have you: Restricted or refused access to your geographical location

Answer Categories	Code
Yes	1
No	2
Valid skip	6
Don’t know	7
Refusal	8
Not stated	9

Variable Name: SP_010B

Concept: Activities carried out to manage access to personal data

Question Text/Note:
Have you carried out any of the following to manage access to your personal data over the Internet during the past 12 months? Have you: Refused allowing the use of personal data for advertising purposes

Answer Categories	Code
Yes	1
No	2
Valid skip	6
Don’t know	7
Refusal	8
Not stated	9

Variable Name: SP_010C

Concept: Activities carried out to manage access to personal data

Question Text/Note:
Have you carried out any of the following to manage access to your personal data over the Internet during the past 12 months? Have you: Checked that the website where you provided personal data was secure

Answer Categories	Code
Yes	1
No	2
Valid skip	6
Don’t know	7
Refusal	8
Not stated	9

Variable Name: SP_010D

Concept: Activities carried out to manage access to personal data

Question Text/Note:
Have you carried out any of the following to manage access to your personal data over the Internet during the past 12 months? Have you: Changed the privacy settings on accounts or apps

Answer Categories	Code
Yes	1
No	2
Valid skip	6
Don’t know	7
Refusal	8
Not stated	9

Security Measures - Setting Up

Variable Name: SP_020A