From 7a16526a97ae80b10509bcf9210bb2b4183dd128 Mon Sep 17 00:00:00 2001 From: Michael Beck Date: Tue, 15 Aug 2023 14:20:13 +0200 Subject: [PATCH] adds dataset profiles --- data/OUT/profiles/AllTweets.html | 1185 +++++++ data/OUT/profiles/CovTweets.html | 5086 ++++++++++++++++++++++++++++++ 2 files changed, 6271 insertions(+) create mode 100644 data/OUT/profiles/AllTweets.html create mode 100644 data/OUT/profiles/CovTweets.html diff --git a/data/OUT/profiles/AllTweets.html b/data/OUT/profiles/AllTweets.html new file mode 100644 index 0000000..5629324 --- /dev/null +++ b/data/OUT/profiles/AllTweets.html @@ -0,0 +1,1185 @@ +Pandas Profiling Report

Overview

Dataset statistics

Number of variables35
Number of observations423967
Missing cells5300783
Missing cells (%)35.7%
Total size in memory113.2 MiB
Average record size in memory280.0 B

Variable types

Categorical31
Unsupported4

Alerts

user.verified has constant value "False"Constant
Unnamed: 0 has a high cardinality: 820 distinct valuesHigh cardinality
id has a high cardinality: 423612 distinct valuesHigh cardinality
user.id has a high cardinality: 177 distinct valuesHigh cardinality
user.username has a high cardinality: 177 distinct valuesHigh cardinality
user.created has a high cardinality: 177 distinct valuesHigh cardinality
user.favouritesCount has a high cardinality: 162 distinct valuesHigh cardinality
user.followersCount has a high cardinality: 193 distinct valuesHigh cardinality
user.friendsCount has a high cardinality: 170 distinct valuesHigh cardinality
user.url has a high cardinality: 177 distinct valuesHigh cardinality
rawContent has a high cardinality: 420937 distinct valuesHigh cardinality
renderedContent has a high cardinality: 420368 distinct valuesHigh cardinality
coordinates has a high cardinality: 205 distinct valuesHigh cardinality
hashtags has a high cardinality: 20247 distinct valuesHigh cardinality
inReplyToTweetId has a high cardinality: 55386 distinct valuesHigh cardinality
inReplyToUser has a high cardinality: 2942 distinct valuesHigh cardinality
media has a high cardinality: 108424 distinct valuesHigh cardinality
mentionedUsers has a high cardinality: 41533 distinct valuesHigh cardinality
links has a high cardinality: 135021 distinct valuesHigh cardinality
place has a high cardinality: 199 distinct valuesHigh cardinality
quotedTweet has a high cardinality: 57466 distinct valuesHigh cardinality
url has a high cardinality: 423612 distinct valuesHigh cardinality
date has a high cardinality: 398624 distinct valuesHigh cardinality
replyCount has a high cardinality: 6433 distinct valuesHigh cardinality
retweetCount has a high cardinality: 8781 distinct valuesHigh cardinality
likeCount has a high cardinality: 22039 distinct valuesHigh cardinality
quoteCount has a high cardinality: 2673 distinct valuesHigh cardinality
conversationId has a high cardinality: 370501 distinct valuesHigh cardinality
contains_keyword has a high cardinality: 1170 distinct valuesHigh cardinality
quoteCount is highly imbalanced (56.9%)Imbalance
lang is highly imbalanced (95.9%)Imbalance
contains_keyword is highly imbalanced (89.9%)Imbalance
cashtags has 423962 (> 99.9%) missing valuesMissing
coordinates has 423163 (99.8%) missing valuesMissing
hashtags has 335306 (79.1%) missing valuesMissing
inReplyToTweetId has 368336 (86.9%) missing valuesMissing
inReplyToUser has 368336 (86.9%) missing valuesMissing
media has 312688 (73.8%) missing valuesMissing
mentionedUsers has 302624 (71.4%) missing valuesMissing
links has 288127 (68.0%) missing valuesMissing
place has 423163 (99.8%) missing valuesMissing
quotedTweet has 359210 (84.7%) missing valuesMissing
retweetedTweet has 423967 (100.0%) missing valuesMissing
sourceLabel has 423967 (100.0%) missing valuesMissing
sourceUrl has 423967 (100.0%) missing valuesMissing
source has 423967 (100.0%) missing valuesMissing
retweetedTweet is an unsupported type, check if it needs cleaning or further analysisUnsupported
sourceLabel is an unsupported type, check if it needs cleaning or further analysisUnsupported
sourceUrl is an unsupported type, check if it needs cleaning or further analysisUnsupported
source is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-08-08 12:58:29.913833
Analysis finished2023-08-08 12:58:39.254746
Duration9.34 seconds
Software versionpandas-profiling v3.6.6
Download configurationconfig.json

Variables

Unnamed: 0
Categorical

Distinct820
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
0
 
3757
1
 
3674
2
 
3623
3
 
3586
4
 
3544
Other values (815)
405783 

Unique

Unique100 ?
Unique (%)< 0.1%

Sample

1st row0
2nd row1
3rd row2
4th row0
5th row1

Common Values

ValueCountFrequency (%)
0 3757
 
0.9%
1 3674
 
0.9%
2 3623
 
0.9%
3 3586
 
0.8%
4 3544
 
0.8%
5 3513
 
0.8%
6 3481
 
0.8%
7 3454
 
0.8%
8 3433
 
0.8%
9 3408
 
0.8%
Other values (810) 388494
91.6%

id
Categorical

Distinct423612
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
1295483839565856768
 
2
1234868300884000771
 
2
1232779505279938561
 
2
1233091991015055362
 
2
1233106983936516096
 
2
Other values (423607)
423957 

Unique

Unique423257 ?
Unique (%)99.8%

Sample

1st row1226988645044891649
2nd row1217185789156777986
3rd row1216891845692837888
4th row1394083996653563907
5th row1393972598237696007

Common Values

ValueCountFrequency (%)
1295483839565856768 2
 
< 0.1%
1234868300884000771 2
 
< 0.1%
1232779505279938561 2
 
< 0.1%
1233091991015055362 2
 
< 0.1%
1233106983936516096 2
 
< 0.1%
1233172655060156416 2
 
< 0.1%
1233492502910570496 2
 
< 0.1%
1233502239416815619 2
 
< 0.1%
1233903840358887424 2
 
< 0.1%
1234546239166799872 2
 
< 0.1%
Other values (423602) 423947
> 99.9%

user.id
Categorical

Distinct177
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
13218102
 
9438
23022687
 
8806
278145569
 
7527
1074480192
 
6770
131546062
 
6678
Other values (172)
384748 

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row87510313
2nd row87510313
3rd row87510313
4th row216776631
5th row216776631

Common Values

ValueCountFrequency (%)
13218102 9438
 
2.2%
23022687 8806
 
2.1%
278145569 7527
 
1.8%
1074480192 6770
 
1.6%
131546062 6678
 
1.6%
242555999 6536
 
1.5%
18915145 6201
 
1.5%
109287731 6171
 
1.5%
150078976 5828
 
1.4%
247334603 5760
 
1.4%
Other values (167) 354252
83.6%

user.username
Categorical

Distinct177
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
JohnCornyn
 
9438
tedcruz
 
8806
MarshaBlackburn
 
7527
SenTedCruz
 
6770
SenRickScott
 
6678
Other values (172)
384748 

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowtammybaldwin
2nd rowtammybaldwin
3rd rowtammybaldwin
4th rowBernieSanders
5th rowBernieSanders

Common Values

ValueCountFrequency (%)
JohnCornyn 9438
 
2.2%
tedcruz 8806
 
2.1%
MarshaBlackburn 7527
 
1.8%
SenTedCruz 6770
 
1.6%
SenRickScott 6678
 
1.6%
SenWhitehouse 6536
 
1.5%
senrobportman 6201
 
1.5%
SenatorShaheen 6171
 
1.5%
ChrisMurphyCT 5828
 
1.4%
SenatorDurbin 5760
 
1.4%
Other values (167) 354252
83.6%

user.verified
Categorical

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
False
423967 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowFalse
2nd rowFalse
3rd rowFalse
4th rowFalse
5th rowFalse

Common Values

ValueCountFrequency (%)
False 423967
100.0%

Common Values (Plot)

2023-08-08T14:58:39.339601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

user.created
Categorical

Distinct177
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
2008-02-07 19:52:55+00:00
 
9438
2009-03-06 03:20:20+00:00
 
8806
2011-04-06 18:05:33+00:00
 
7527
2013-01-09 18:11:37+00:00
 
6770
2010-04-10 16:03:04+00:00
 
6678
Other values (172)
384748 

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row2009-11-04 19:20:25+00:00
2nd row2009-11-04 19:20:25+00:00
3rd row2009-11-04 19:20:25+00:00
4th row2010-11-17 17:53:52+00:00
5th row2010-11-17 17:53:52+00:00

Common Values

ValueCountFrequency (%)
2008-02-07 19:52:55+00:00 9438
 
2.2%
2009-03-06 03:20:20+00:00 8806
 
2.1%
2011-04-06 18:05:33+00:00 7527
 
1.8%
2013-01-09 18:11:37+00:00 6770
 
1.6%
2010-04-10 16:03:04+00:00 6678
 
1.6%
2011-01-25 01:52:03+00:00 6536
 
1.5%
2009-01-12 20:56:42+00:00 6201
 
1.5%
2010-01-28 15:22:44+00:00 6171
 
1.5%
2010-05-31 01:22:43+00:00 5828
 
1.4%
2011-02-04 15:50:42+00:00 5760
 
1.4%
Other values (167) 354252
83.6%
Distinct162
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
510
 
10509
10678
 
9438
1294
 
8806
1568
 
8663
2138
 
7527
Other values (157)
379024 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row289
2nd row289
3rd row289
4th row939
5th row939

Common Values

ValueCountFrequency (%)
510 10509
 
2.5%
10678 9438
 
2.2%
1294 8806
 
2.1%
1568 8663
 
2.0%
2138 7527
 
1.8%
523 6770
 
1.6%
256 6678
 
1.6%
1098 6536
 
1.5%
467 6201
 
1.5%
139 6047
 
1.4%
Other values (152) 346792
81.8%
Distinct193
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
369869
 
9438
1019563
 
7209
3404475
 
6770
445543
 
6678
603853
 
6536
Other values (188)
387336 

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row53853
2nd row53853
3rd row53853
4th row15408026
5th row15408026

Common Values

ValueCountFrequency (%)
369869 9438
 
2.2%
1019563 7209
 
1.7%
3404475 6770
 
1.6%
445543 6678
 
1.6%
603853 6536
 
1.5%
168873 6201
 
1.5%
129843 6171
 
1.5%
1095774 5828
 
1.4%
739969 5760
 
1.4%
115510 5696
 
1.3%
Other values (183) 357680
84.4%
Distinct170
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
13109
 
9438
5418
 
8806
2022
 
7527
5481
 
6770
1283
 
6678
Other values (165)
384748 

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row4072
2nd row4072
3rd row4072
4th row1457
5th row1457

Common Values

ValueCountFrequency (%)
13109 9438
 
2.2%
5418 8806
 
2.1%
2022 7527
 
1.8%
5481 6770
 
1.6%
1283 6678
 
1.6%
2028 6536
 
1.5%
4994 6201
 
1.5%
2822 6171
 
1.5%
271 5828
 
1.4%
2418 5760
 
1.4%
Other values (160) 354252
83.6%

user.url
Categorical

Distinct177
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
https://twitter.com/JohnCornyn
 
9438
https://twitter.com/tedcruz
 
8806
https://twitter.com/MarshaBlackburn
 
7527
https://twitter.com/SenTedCruz
 
6770
https://twitter.com/SenRickScott
 
6678
Other values (172)
384748 

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowhttps://twitter.com/tammybaldwin
2nd rowhttps://twitter.com/tammybaldwin
3rd rowhttps://twitter.com/tammybaldwin
4th rowhttps://twitter.com/BernieSanders
5th rowhttps://twitter.com/BernieSanders

Common Values

ValueCountFrequency (%)
https://twitter.com/JohnCornyn 9438
 
2.2%
https://twitter.com/tedcruz 8806
 
2.1%
https://twitter.com/MarshaBlackburn 7527
 
1.8%
https://twitter.com/SenTedCruz 6770
 
1.6%
https://twitter.com/SenRickScott 6678
 
1.6%
https://twitter.com/SenWhitehouse 6536
 
1.5%
https://twitter.com/senrobportman 6201
 
1.5%
https://twitter.com/SenatorShaheen 6171
 
1.5%
https://twitter.com/ChrisMurphyCT 5828
 
1.4%
https://twitter.com/SenatorDurbin 5760
 
1.4%
Other values (167) 354252
83.6%

rawContent
Categorical

Distinct420937
Distinct (%)99.3%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
👀
 
180
👇
 
112
🤔
 
106
😬
 
60
There still is a crisis at the border.
 
56
Other values (420932)
423453 

Unique

Unique419522 ?
Unique (%)99.0%

Sample

1st rowWe need more folks like @mjhegar in the Senate. Take a look at her ad below and chip in a few dollars if you can!
2nd rowPresident Trump will be in MKE tonight while the Dem candidates are debating. Today I joined @HallieJackson to discuss the clear choice for voters. Democrats want to protect coverage for preexisting conditions while President Trump wants to take them away. https://t.co/RheooZ2wAS
3rd rowThe American people want a fair and honest trial!
4th rowThe problem is not a labor shortage in this country. The problem is that in state after state, workers are being asked to work for starvation wages and no benefits. https://t.co/y9QbLkjbP9
5th rowThe devastation in Gaza is unconscionable. We must urge an immediate ceasefire. The killing of Palestinians and Israelis must end. We must also take a hard look at nearly $4 billion a year in military aid to Israel. It is illegal for U.S. aid to support human rights violations.

Common Values

ValueCountFrequency (%)
👀 180
 
< 0.1%
👇 112
 
< 0.1%
🤔 106
 
< 0.1%
😬 60
 
< 0.1%
There still is a crisis at the border. 56
 
< 0.1%
👇👇👇 49
 
< 0.1%
Today would be a great day for President Biden and Vice President Harris to #CancelStudentDebt. 40
 
< 0.1%
🔥🔥🔥 35
 
< 0.1%
Yes. 28
 
< 0.1%
#BidenBorderCrisis 22
 
< 0.1%
Other values (420927) 423279
99.8%

renderedContent
Categorical

Distinct420368
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
👀
 
180
👇
 
112
🤔
 
106
markey.senate.gov/news/press-rel…
 
81
😬
 
60
Other values (420363)
423428 

Unique

Unique418757 ?
Unique (%)98.8%

Sample

1st rowWe need more folks like @mjhegar in the Senate. Take a look at her ad below and chip in a few dollars if you can!
2nd rowPresident Trump will be in MKE tonight while the Dem candidates are debating. Today I joined @HallieJackson to discuss the clear choice for voters. Democrats want to protect coverage for preexisting conditions while President Trump wants to take them away. msnbc.com/hallie-jackson…
3rd rowThe American people want a fair and honest trial!
4th rowThe problem is not a labor shortage in this country. The problem is that in state after state, workers are being asked to work for starvation wages and no benefits. https://t.co/y9QbLkjbP9
5th rowThe devastation in Gaza is unconscionable. We must urge an immediate ceasefire. The killing of Palestinians and Israelis must end. We must also take a hard look at nearly $4 billion a year in military aid to Israel. It is illegal for U.S. aid to support human rights violations.

Common Values

ValueCountFrequency (%)
👀 180
 
< 0.1%
👇 112
 
< 0.1%
🤔 106
 
< 0.1%
markey.senate.gov/news/press-rel… 81
 
< 0.1%
😬 60
 
< 0.1%
There still is a crisis at the border. 56
 
< 0.1%
👇👇👇 49
 
< 0.1%
Speaking live on the Senate floor: twitter.com/i/broadcasts/1… 46
 
< 0.1%
rubio.senate.gov/public/index.c… 40
 
< 0.1%
Today would be a great day for President Biden and Vice President Harris to #CancelStudentDebt. 40
 
< 0.1%
Other values (420358) 423197
99.8%

cashtags
Categorical

Distinct3
Distinct (%)60.0%
Missing423962
Missing (%)> 99.9%
Memory size3.2 MiB
['s']
['GME']
['TSLA']

Unique

Unique2 ?
Unique (%)40.0%

Sample

1st row['GME']
2nd row['s']
3rd row['TSLA']
4th row['s']
5th row['s']

Common Values

ValueCountFrequency (%)
['s'] 3
 
< 0.1%
['GME'] 1
 
< 0.1%
['TSLA'] 1
 
< 0.1%
(Missing) 423962
> 99.9%

Common Values (Plot)

2023-08-08T14:58:39.467685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

coordinates
Categorical

HIGH CARDINALITY  MISSING 

Distinct205
Distinct (%)25.5%
Missing423163
Missing (%)99.8%
Memory size3.2 MiB
Coordinates(longitude=-77.119401, latitude=38.801826)
243 
Coordinates(longitude=-104.048915, latitude=45.935021)
 
32
Coordinates(longitude=-100.839587, latitude=46.756481)
 
30
Coordinates(longitude=-81.7071748, latitude=38.293241)
 
22
Coordinates(longitude=-75.7887564, latitude=38.4510398)
 
19
Other values (200)
458 

Unique

Unique116 ?
Unique (%)14.4%

Sample

1st rowCoordinates(longitude=-92.889433, latitude=42.491921)
2nd rowCoordinates(longitude=-92.889433, latitude=42.491921)
3rd rowCoordinates(longitude=-91.7745785, latitude=41.886245)
4th rowCoordinates(longitude=-92.600005, latitude=41.978479)
5th rowCoordinates(longitude=-92.821734, latitude=41.004573)

Common Values

ValueCountFrequency (%)
Coordinates(longitude=-77.119401, latitude=38.801826) 243
 
0.1%
Coordinates(longitude=-104.048915, latitude=45.935021) 32
 
< 0.1%
Coordinates(longitude=-100.839587, latitude=46.756481) 30
 
< 0.1%
Coordinates(longitude=-81.7071748, latitude=38.293241) 22
 
< 0.1%
Coordinates(longitude=-75.7887564, latitude=38.4510398) 19
 
< 0.1%
Coordinates(longitude=-102.051769, latitude=36.9931101) 16
 
< 0.1%
Coordinates(longitude=-93.709504, latitude=41.501409) 16
 
< 0.1%
Coordinates(longitude=-82.644739, latitude=37.201483) 12
 
< 0.1%
Coordinates(longitude=-77.172219, latitude=38.827378) 11
 
< 0.1%
Coordinates(longitude=-86.550888, latitude=36.3161629) 11
 
< 0.1%
Other values (195) 392
 
0.1%
(Missing) 423163
99.8%

hashtags
Categorical

HIGH CARDINALITY  MISSING 

Distinct20247
Distinct (%)22.8%
Missing335306
Missing (%)79.1%
Memory size3.2 MiB
['COVID19']
 
6023
['coronavirus']
 
1349
['gapol', 'gasen']
 
1180
['BuildBackBetter']
 
914
['BidenBorderCrisis']
 
912
Other values (20242)
78283 

Unique

Unique14228 ?
Unique (%)16.0%

Sample

1st row['BernieInKY']
2nd row['BernieInKY']
3rd row['BernieInKY']
4th row['BernieInKY']
5th row['BernieInKY']

Common Values

ValueCountFrequency (%)
['COVID19'] 6023
 
1.4%
['coronavirus'] 1349
 
0.3%
['gapol', 'gasen'] 1180
 
0.3%
['BuildBackBetter'] 914
 
0.2%
['BidenBorderCrisis'] 912
 
0.2%
['CancelStudentDebt'] 733
 
0.2%
['BlackHistoryMonth'] 692
 
0.2%
['AmericanRescuePlan'] 676
 
0.2%
['SCOTUS'] 606
 
0.1%
['SOTU'] 577
 
0.1%
Other values (20237) 74999
 
17.7%
(Missing) 335306
79.1%

inReplyToTweetId
Categorical

HIGH CARDINALITY  MISSING 

Distinct55386
Distinct (%)99.6%
Missing368336
Missing (%)86.9%
Memory size3.2 MiB
1.2328318294360556e+18
 
11
1.5949093086844928e+18
 
7
1.504212607590355e+18
 
5
1.263626990227198e+18
 
5
1.5117488363095818e+18
 
5
Other values (55381)
55598 

Unique

Unique55190 ?
Unique (%)99.2%

Sample

1st row1.3888591429500355e+18
2nd row1.3888591417084518e+18
3rd row1.3888591406598963e+18
4th row1.3888591397204664e+18
5th row1.3888591387473756e+18

Common Values

ValueCountFrequency (%)
1.2328318294360556e+18 11
 
< 0.1%
1.5949093086844928e+18 7
 
< 0.1%
1.504212607590355e+18 5
 
< 0.1%
1.263626990227198e+18 5
 
< 0.1%
1.5117488363095818e+18 5
 
< 0.1%
1.4280314916662313e+18 4
 
< 0.1%
1.3372490489533112e+18 4
 
< 0.1%
1.5962462076090696e+18 4
 
< 0.1%
1.6061351559023862e+18 3
 
< 0.1%
1.2725637151963505e+18 3
 
< 0.1%
Other values (55376) 55580
 
13.1%
(Missing) 368336
86.9%

inReplyToUser
Categorical

HIGH CARDINALITY  MISSING 

Distinct2942
Distinct (%)5.3%
Missing368336
Missing (%)86.9%
Memory size3.2 MiB
https://twitter.com/senrobportman
 
1731
https://twitter.com/lisamurkowski
 
1708
https://twitter.com/LindseyGrahamSC
 
1434
https://twitter.com/SenBlumenthal
 
1405
https://twitter.com/SenWhitehouse
 
1397
Other values (2937)
47956 

Unique

Unique2341 ?
Unique (%)4.2%

Sample

1st rowhttps://twitter.com/BernieSanders
2nd rowhttps://twitter.com/BernieSanders
3rd rowhttps://twitter.com/BernieSanders
4th rowhttps://twitter.com/BernieSanders
5th rowhttps://twitter.com/BernieSanders

Common Values

ValueCountFrequency (%)
https://twitter.com/senrobportman 1731
 
0.4%
https://twitter.com/lisamurkowski 1708
 
0.4%
https://twitter.com/LindseyGrahamSC 1434
 
0.3%
https://twitter.com/SenBlumenthal 1405
 
0.3%
https://twitter.com/SenWhitehouse 1397
 
0.3%
https://twitter.com/SenatorMenendez 1347
 
0.3%
https://twitter.com/SenBobCasey 1342
 
0.3%
https://twitter.com/ChrisCoons 1241
 
0.3%
https://twitter.com/JohnCornyn 1178
 
0.3%
https://twitter.com/SenRickScott 1019
 
0.2%
Other values (2932) 41829
 
9.9%
(Missing) 368336
86.9%

media
Categorical

HIGH CARDINALITY  MISSING 

Distinct108424
Distinct (%)97.4%
Missing312688
Missing (%)73.8%
Memory size3.2 MiB
[Video(thumbnailUrl='https://pbs.twimg.com/amplify_video_thumb/1225775550448439296/img/OorzQZ8gL_TgPHFf.jpg', variants=[VideoVariant(url='https://video.twimg.com/amplify_video/1225775550448439296/vid/640x360/pt6ZsDytiLWUMgQ5.mp4?tag=13', contentType='video/mp4', bitrate=832000), VideoVariant(url='https://video.twimg.com/amplify_video/1225775550448439296/vid/480x270/XGch5_MXP5VpOoT9.mp4?tag=13', contentType='video/mp4', bitrate=288000), VideoVariant(url='https://video.twimg.com/amplify_video/1225775550448439296/pl/IdL9UwWWcazfrbmB.m3u8?tag=13', contentType='application/x-mpegURL', bitrate=None), VideoVariant(url='https://video.twimg.com/amplify_video/1225775550448439296/vid/1280x720/sxBjx6xvOH2xKrGj.mp4?tag=13', contentType='video/mp4', bitrate=2176000)], duration=3.462, views=43997, altText=None)]
 
47
[Video(thumbnailUrl='https://pbs.twimg.com/amplify_video_thumb/1302307323470319616/img/AqtjJsB1Rnndj79X.jpg', variants=[VideoVariant(url='https://video.twimg.com/amplify_video/1302307323470319616/vid/480x270/TVSedxlTI706vPn_.mp4?tag=13', contentType='video/mp4', bitrate=288000), VideoVariant(url='https://video.twimg.com/amplify_video/1302307323470319616/pl/5zzHU49sHl9W5Zcg.m3u8?tag=13', contentType='application/x-mpegURL', bitrate=None), VideoVariant(url='https://video.twimg.com/amplify_video/1302307323470319616/vid/1280x720/nXn6oFl144kuH7p5.mp4?tag=13', contentType='video/mp4', bitrate=2176000), VideoVariant(url='https://video.twimg.com/amplify_video/1302307323470319616/vid/640x360/prGY9PcLBpjth_mR.mp4?tag=13', contentType='video/mp4', bitrate=832000)], duration=31.732, views=7583, altText=None)]
 
11
[Video(thumbnailUrl='https://pbs.twimg.com/amplify_video_thumb/1308723265947545602/img/fteOKUXbfnZvDxRM.jpg', variants=[VideoVariant(url='https://video.twimg.com/amplify_video/1308723265947545602/vid/480x270/nadZGl6dQxqIiB8l.mp4?tag=13', contentType='video/mp4', bitrate=288000), VideoVariant(url='https://video.twimg.com/amplify_video/1308723265947545602/vid/640x360/trfnQBPDh4qTJSsE.mp4?tag=13', contentType='video/mp4', bitrate=832000), VideoVariant(url='https://video.twimg.com/amplify_video/1308723265947545602/pl/YGio6mjrClPvzMSu.m3u8?tag=13', contentType='application/x-mpegURL', bitrate=None), VideoVariant(url='https://video.twimg.com/amplify_video/1308723265947545602/vid/1280x720/bEfK6coyBzlidmU7.mp4?tag=13', contentType='video/mp4', bitrate=2176000)], duration=30.155, views=18189, altText=None)]
 
11
[Video(thumbnailUrl='https://pbs.twimg.com/amplify_video_thumb/1293900872213291008/img/pX-vF3riIJsoeccY.jpg', variants=[VideoVariant(url='https://video.twimg.com/amplify_video/1293900872213291008/vid/480x270/aYO1ydehwGPE24nz.mp4?tag=13', contentType='video/mp4', bitrate=288000), VideoVariant(url='https://video.twimg.com/amplify_video/1293900872213291008/pl/701H0265y9yCgEdl.m3u8?tag=13', contentType='application/x-mpegURL', bitrate=None), VideoVariant(url='https://video.twimg.com/amplify_video/1293900872213291008/vid/640x360/oBIlDzjqv9ENrv9s.mp4?tag=13', contentType='video/mp4', bitrate=832000), VideoVariant(url='https://video.twimg.com/amplify_video/1293900872213291008/vid/1280x720/JyjhwZsi8ThCdIaC.mp4?tag=13', contentType='video/mp4', bitrate=2176000)], duration=30.03, views=14408, altText=None)]
 
11
[Video(thumbnailUrl='https://pbs.twimg.com/ext_tw_video_thumb/1255503615038545925/pu/img/TuxOaa5LgRhW-ISS.jpg', variants=[VideoVariant(url='https://video.twimg.com/ext_tw_video/1255503615038545925/pu/vid/1280x720/3VSQXmozy2Y9bo-q.mp4?tag=10', contentType='video/mp4', bitrate=2176000), VideoVariant(url='https://video.twimg.com/ext_tw_video/1255503615038545925/pu/vid/480x270/lEzMrdAqX0YCZRBD.mp4?tag=10', contentType='video/mp4', bitrate=256000), VideoVariant(url='https://video.twimg.com/ext_tw_video/1255503615038545925/pu/vid/640x360/J0F_sb2gRFwNxeub.mp4?tag=10', contentType='video/mp4', bitrate=832000), VideoVariant(url='https://video.twimg.com/ext_tw_video/1255503615038545925/pu/pl/QlXgy81fRppcPRgl.m3u8?tag=10', contentType='application/x-mpegURL', bitrate=None)], duration=55.722, views=28851, altText=None)]
 
11
Other values (108419)
111188 

Unique

Unique106321 ?
Unique (%)95.5%

Sample

1st row[Video(thumbnailUrl='https://pbs.twimg.com/media/E1jHWm7WUAMdBoN.jpg', variants=[VideoVariant(url='https://video.twimg.com/amplify_video/1394082853424611332/vid/320x320/OQwvo-GqY4Ixx1Gn.mp4?tag=14', contentType='video/mp4', bitrate=432000), VideoVariant(url='https://video.twimg.com/amplify_video/1394082853424611332/vid/720x720/GryrvCS9RH4GRvPA.mp4?tag=14', contentType='video/mp4', bitrate=1280000), VideoVariant(url='https://video.twimg.com/amplify_video/1394082853424611332/pl/RGkvFi0FPDApM-gV.m3u8?tag=14', contentType='application/x-mpegURL', bitrate=None), VideoVariant(url='https://video.twimg.com/amplify_video/1394082853424611332/vid/540x540/2pL7jBwgbx-GSles.mp4?tag=14', contentType='video/mp4', bitrate=832000)], duration=90.716, views=158678, altText=None)]
2nd row[Video(thumbnailUrl='https://pbs.twimg.com/media/E1R2EtyX0AoQi9P.jpg', variants=[VideoVariant(url='https://video.twimg.com/amplify_video/1392867688175833091/vid/320x320/qo6EeW6fPGXG5yNQ.mp4?tag=14', contentType='video/mp4', bitrate=432000), VideoVariant(url='https://video.twimg.com/amplify_video/1392867688175833091/vid/720x720/VPaSE91KtPhS1Hcc.mp4?tag=14', contentType='video/mp4', bitrate=1280000), VideoVariant(url='https://video.twimg.com/amplify_video/1392867688175833091/vid/540x540/vO6IsCKzp_v_6uKr.mp4?tag=14', contentType='video/mp4', bitrate=832000), VideoVariant(url='https://video.twimg.com/amplify_video/1392867688175833091/pl/CWcaSNp0Hv3TUGsu.m3u8?tag=14', contentType='application/x-mpegURL', bitrate=None)], duration=90.023, views=73925, altText=None)]
3rd row[Video(thumbnailUrl='https://pbs.twimg.com/media/E0otrNjXsAQ5nqv.jpg', variants=[VideoVariant(url='https://video.twimg.com/amplify_video/1389972979266773003/vid/1280x720/9kiwOPeXmRy916Gw.mp4?tag=14', contentType='video/mp4', bitrate=2176000), VideoVariant(url='https://video.twimg.com/amplify_video/1389972979266773003/vid/480x270/BKJbU611BkJ_33MS.mp4?tag=14', contentType='video/mp4', bitrate=288000), VideoVariant(url='https://video.twimg.com/amplify_video/1389972979266773003/vid/640x360/rJ4Pay7ZvPMgKZDr.mp4?tag=14', contentType='video/mp4', bitrate=832000), VideoVariant(url='https://video.twimg.com/amplify_video/1389972979266773003/pl/VCbEC08PqbDlhBKY.m3u8?tag=14', contentType='application/x-mpegURL', bitrate=None)], duration=158.408, views=229353, altText=None)]
4th row[Video(thumbnailUrl='https://pbs.twimg.com/media/E0kjHjOWEAsliuC.jpg', variants=[VideoVariant(url='https://video.twimg.com/amplify_video/1389679079226585092/vid/320x320/4oN-gis9mm3QKnrR.mp4?tag=14', contentType='video/mp4', bitrate=432000), VideoVariant(url='https://video.twimg.com/amplify_video/1389679079226585092/pl/ss1X6opHbpFBZs0F.m3u8?tag=14', contentType='application/x-mpegURL', bitrate=None), VideoVariant(url='https://video.twimg.com/amplify_video/1389679079226585092/vid/720x720/RKy3MXuX2OfO0t13.mp4?tag=14', contentType='video/mp4', bitrate=1280000), VideoVariant(url='https://video.twimg.com/amplify_video/1389679079226585092/vid/540x540/gvf5AAC_ZgLteVOI.mp4?tag=14', contentType='video/mp4', bitrate=832000)], duration=110.043, views=117524, altText=None)]
5th row[Photo(previewUrl='https://pbs.twimg.com/media/E0kFHuKXEAUPu2G?format=jpg&name=small', fullUrl='https://pbs.twimg.com/media/E0kFHuKXEAUPu2G?format=jpg&name=orig', altText=None)]

Common Values

ValueCountFrequency (%)
[Video(thumbnailUrl='https://pbs.twimg.com/amplify_video_thumb/1225775550448439296/img/OorzQZ8gL_TgPHFf.jpg', variants=[VideoVariant(url='https://video.twimg.com/amplify_video/1225775550448439296/vid/640x360/pt6ZsDytiLWUMgQ5.mp4?tag=13', contentType='video/mp4', bitrate=832000), VideoVariant(url='https://video.twimg.com/amplify_video/1225775550448439296/vid/480x270/XGch5_MXP5VpOoT9.mp4?tag=13', contentType='video/mp4', bitrate=288000), VideoVariant(url='https://video.twimg.com/amplify_video/1225775550448439296/pl/IdL9UwWWcazfrbmB.m3u8?tag=13', contentType='application/x-mpegURL', bitrate=None), VideoVariant(url='https://video.twimg.com/amplify_video/1225775550448439296/vid/1280x720/sxBjx6xvOH2xKrGj.mp4?tag=13', contentType='video/mp4', bitrate=2176000)], duration=3.462, views=43997, altText=None)] 47
 
< 0.1%
[Video(thumbnailUrl='https://pbs.twimg.com/amplify_video_thumb/1302307323470319616/img/AqtjJsB1Rnndj79X.jpg', variants=[VideoVariant(url='https://video.twimg.com/amplify_video/1302307323470319616/vid/480x270/TVSedxlTI706vPn_.mp4?tag=13', contentType='video/mp4', bitrate=288000), VideoVariant(url='https://video.twimg.com/amplify_video/1302307323470319616/pl/5zzHU49sHl9W5Zcg.m3u8?tag=13', contentType='application/x-mpegURL', bitrate=None), VideoVariant(url='https://video.twimg.com/amplify_video/1302307323470319616/vid/1280x720/nXn6oFl144kuH7p5.mp4?tag=13', contentType='video/mp4', bitrate=2176000), VideoVariant(url='https://video.twimg.com/amplify_video/1302307323470319616/vid/640x360/prGY9PcLBpjth_mR.mp4?tag=13', contentType='video/mp4', bitrate=832000)], duration=31.732, views=7583, altText=None)] 11
 
< 0.1%
[Video(thumbnailUrl='https://pbs.twimg.com/amplify_video_thumb/1308723265947545602/img/fteOKUXbfnZvDxRM.jpg', variants=[VideoVariant(url='https://video.twimg.com/amplify_video/1308723265947545602/vid/480x270/nadZGl6dQxqIiB8l.mp4?tag=13', contentType='video/mp4', bitrate=288000), VideoVariant(url='https://video.twimg.com/amplify_video/1308723265947545602/vid/640x360/trfnQBPDh4qTJSsE.mp4?tag=13', contentType='video/mp4', bitrate=832000), VideoVariant(url='https://video.twimg.com/amplify_video/1308723265947545602/pl/YGio6mjrClPvzMSu.m3u8?tag=13', contentType='application/x-mpegURL', bitrate=None), VideoVariant(url='https://video.twimg.com/amplify_video/1308723265947545602/vid/1280x720/bEfK6coyBzlidmU7.mp4?tag=13', contentType='video/mp4', bitrate=2176000)], duration=30.155, views=18189, altText=None)] 11
 
< 0.1%
[Video(thumbnailUrl='https://pbs.twimg.com/amplify_video_thumb/1293900872213291008/img/pX-vF3riIJsoeccY.jpg', variants=[VideoVariant(url='https://video.twimg.com/amplify_video/1293900872213291008/vid/480x270/aYO1ydehwGPE24nz.mp4?tag=13', contentType='video/mp4', bitrate=288000), VideoVariant(url='https://video.twimg.com/amplify_video/1293900872213291008/pl/701H0265y9yCgEdl.m3u8?tag=13', contentType='application/x-mpegURL', bitrate=None), VideoVariant(url='https://video.twimg.com/amplify_video/1293900872213291008/vid/640x360/oBIlDzjqv9ENrv9s.mp4?tag=13', contentType='video/mp4', bitrate=832000), VideoVariant(url='https://video.twimg.com/amplify_video/1293900872213291008/vid/1280x720/JyjhwZsi8ThCdIaC.mp4?tag=13', contentType='video/mp4', bitrate=2176000)], duration=30.03, views=14408, altText=None)] 11
 
< 0.1%
[Video(thumbnailUrl='https://pbs.twimg.com/ext_tw_video_thumb/1255503615038545925/pu/img/TuxOaa5LgRhW-ISS.jpg', variants=[VideoVariant(url='https://video.twimg.com/ext_tw_video/1255503615038545925/pu/vid/1280x720/3VSQXmozy2Y9bo-q.mp4?tag=10', contentType='video/mp4', bitrate=2176000), VideoVariant(url='https://video.twimg.com/ext_tw_video/1255503615038545925/pu/vid/480x270/lEzMrdAqX0YCZRBD.mp4?tag=10', contentType='video/mp4', bitrate=256000), VideoVariant(url='https://video.twimg.com/ext_tw_video/1255503615038545925/pu/vid/640x360/J0F_sb2gRFwNxeub.mp4?tag=10', contentType='video/mp4', bitrate=832000), VideoVariant(url='https://video.twimg.com/ext_tw_video/1255503615038545925/pu/pl/QlXgy81fRppcPRgl.m3u8?tag=10', contentType='application/x-mpegURL', bitrate=None)], duration=55.722, views=28851, altText=None)] 11
 
< 0.1%
[Video(thumbnailUrl='https://pbs.twimg.com/media/EVA7uHnXkAUOEnv.jpg', variants=[VideoVariant(url='https://video.twimg.com/amplify_video/1247562475089924099/vid/480x480/r1EZCycTVfSAjGEf.mp4?tag=13', contentType='video/mp4', bitrate=832000), VideoVariant(url='https://video.twimg.com/amplify_video/1247562475089924099/vid/320x320/9C-_LNOZ-mN0Vlfh.mp4?tag=13', contentType='video/mp4', bitrate=432000), VideoVariant(url='https://video.twimg.com/amplify_video/1247562475089924099/pl/DxcPDK7x7Zvei8Mi.m3u8?tag=13', contentType='application/x-mpegURL', bitrate=None), VideoVariant(url='https://video.twimg.com/amplify_video/1247562475089924099/vid/720x720/d1L-2RQmpxxlIqCx.mp4?tag=13', contentType='video/mp4', bitrate=1280000)], duration=74.408, views=40408, altText=None)] 11
 
< 0.1%
[Video(thumbnailUrl='https://pbs.twimg.com/media/EcSYCcAXgAA4-zx.jpg', variants=[VideoVariant(url='https://video.twimg.com/amplify_video/1280314299454558214/vid/480x270/5T2ZZLMIomkbwH7R.mp4?tag=13', contentType='video/mp4', bitrate=288000), VideoVariant(url='https://video.twimg.com/amplify_video/1280314299454558214/vid/1280x720/nkrX8VfUjjUGynrI.mp4?tag=13', contentType='video/mp4', bitrate=2176000), VideoVariant(url='https://video.twimg.com/amplify_video/1280314299454558214/vid/640x360/bKT4UiEXRlTD4kz4.mp4?tag=13', contentType='video/mp4', bitrate=832000), VideoVariant(url='https://video.twimg.com/amplify_video/1280314299454558214/pl/MsZ3_olToB6_HhQ8.m3u8?tag=13', contentType='application/x-mpegURL', bitrate=None)], duration=57.731, views=135655, altText=None)] 10
 
< 0.1%
[Video(thumbnailUrl='https://pbs.twimg.com/ext_tw_video_thumb/1235997093333958657/pu/img/0BpzML29aKZkdWXi.jpg', variants=[VideoVariant(url='https://video.twimg.com/ext_tw_video/1235997093333958657/pu/vid/640x360/nf_7qMpPmj5nqjHw.mp4?tag=10', contentType='video/mp4', bitrate=832000), VideoVariant(url='https://video.twimg.com/ext_tw_video/1235997093333958657/pu/vid/1280x720/UQ6Mm8r4hHpWEYcm.mp4?tag=10', contentType='video/mp4', bitrate=2176000), VideoVariant(url='https://video.twimg.com/ext_tw_video/1235997093333958657/pu/pl/wWezRq2ZFfIvL_vJ.m3u8?tag=10', contentType='application/x-mpegURL', bitrate=None), VideoVariant(url='https://video.twimg.com/ext_tw_video/1235997093333958657/pu/vid/480x270/pvByeCU5JfgklEDx.mp4?tag=10', contentType='video/mp4', bitrate=256000)], duration=3.462, views=15545, altText=None)] 10
 
< 0.1%
[Video(thumbnailUrl='https://pbs.twimg.com/ext_tw_video_thumb/1265252962177335304/pu/img/MW152CDo1Gh5_EBC.jpg', variants=[VideoVariant(url='https://video.twimg.com/ext_tw_video/1265252962177335304/pu/vid/1280x720/eDflAuw3ixKJ47_5.mp4?tag=10', contentType='video/mp4', bitrate=2176000), VideoVariant(url='https://video.twimg.com/ext_tw_video/1265252962177335304/pu/pl/rOLaQ5O6iTFsX1ua.m3u8?tag=10', contentType='application/x-mpegURL', bitrate=None), VideoVariant(url='https://video.twimg.com/ext_tw_video/1265252962177335304/pu/vid/640x360/U1PfOmh-wOW1Axaa.mp4?tag=10', contentType='video/mp4', bitrate=832000), VideoVariant(url='https://video.twimg.com/ext_tw_video/1265252962177335304/pu/vid/480x270/wsPTWWyz72EBdTCX.mp4?tag=10', contentType='video/mp4', bitrate=256000)], duration=30.03, views=43460, altText=None)] 10
 
< 0.1%
[Video(thumbnailUrl='https://pbs.twimg.com/amplify_video_thumb/1304031579111665670/img/3PNdVUzVL2YNFe5p.jpg', variants=[VideoVariant(url='https://video.twimg.com/amplify_video/1304031579111665670/vid/640x360/U_lLhm66gc06n3EJ.mp4?tag=13', contentType='video/mp4', bitrate=832000), VideoVariant(url='https://video.twimg.com/amplify_video/1304031579111665670/vid/480x270/vIzLXbEO56qTdXso.mp4?tag=13', contentType='video/mp4', bitrate=288000), VideoVariant(url='https://video.twimg.com/amplify_video/1304031579111665670/vid/1280x720/f61oaE8ZLGvnjmVe.mp4?tag=13', contentType='video/mp4', bitrate=2176000), VideoVariant(url='https://video.twimg.com/amplify_video/1304031579111665670/pl/H1lifZG6Whk2CGJ-.m3u8?tag=13', contentType='application/x-mpegURL', bitrate=None)], duration=30.03, views=6843, altText=None)] 9
 
< 0.1%
Other values (108414) 111138
 
26.2%
(Missing) 312688
73.8%

mentionedUsers
Categorical

HIGH CARDINALITY  MISSING 

Distinct41533
Distinct (%)34.2%
Missing302624
Missing (%)71.4%
Memory size3.2 MiB
[User(username='POTUS', id=1349149096909668363, displayname='President Biden', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)]
 
5732
[User(username='realDonaldTrump', id=25073877, displayname='Donald J. Trump', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)]
 
4077
[User(username='JoeBiden', id=939091, displayname='Joe Biden', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)]
 
2325
[User(username='ossoff', id=521747968, displayname='Jon Ossoff', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)]
 
647
[User(username='WSJ', id=3108351, displayname='The Wall Street Journal', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)]
 
640
Other values (41528)
107922 

Unique

Unique31691 ?
Unique (%)26.1%

Sample

1st row[User(username='mjhegar', id=3021460584, displayname='MJ Hegar🇺🇦🌻', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)]
2nd row[User(username='HallieJackson', id=37590426, displayname='Hallie Jackson', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)]
3rd row[User(username='Booker4KY', id=3298708805, displayname='Charles Booker', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)]
4th row[User(username='Booker4KY', id=3298708805, displayname='Charles Booker', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)]
5th row[User(username='Booker4KY', id=3298708805, displayname='Charles Booker', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)]

Common Values

ValueCountFrequency (%)
[User(username='POTUS', id=1349149096909668363, displayname='President Biden', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)] 5732
 
1.4%
[User(username='realDonaldTrump', id=25073877, displayname='Donald J. Trump', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)] 4077
 
1.0%
[User(username='JoeBiden', id=939091, displayname='Joe Biden', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)] 2325
 
0.5%
[User(username='ossoff', id=521747968, displayname='Jon Ossoff', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)] 647
 
0.2%
[User(username='WSJ', id=3108351, displayname='The Wall Street Journal', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)] 640
 
0.2%
[User(username='USPS', id=386507775, displayname='U.S. Postal Service', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)] 628
 
0.1%
[User(username='SBAgov', id=153149305, displayname='SBA', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)] 531
 
0.1%
[User(username='USDA', id=61853389, displayname='Dept. of Agriculture', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)] 527
 
0.1%
[User(username='SenateDems', id=73238146, displayname='Senate Democrats', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)] 511
 
0.1%
[User(username='CDCgov', id=146569971, displayname='CDC', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)] 473
 
0.1%
Other values (41523) 105252
 
24.8%
(Missing) 302624
71.4%

links
Categorical

HIGH CARDINALITY  MISSING 

Distinct135021
Distinct (%)99.4%
Missing288127
Missing (%)68.0%
Memory size3.2 MiB
[TextLink(text='floridahealthcovid19.gov/covid-19-vacci…', url='https://floridahealthcovid19.gov/covid-19-vaccines-in-florida/', tcourl='https://t.co/h4jdZoPS1z', indices=(125, 148))]
 
15
[TextLink(text='floridahealthcovid19.gov/covid-19-vacci…', url='https://floridahealthcovid19.gov/covid-19-vaccines-in-florida/', tcourl='https://t.co/h4jdZoPS1z', indices=(119, 142))]
 
14
[TextLink(text='floridahealthcovid19.gov/covid-19-vacci…', url='https://floridahealthcovid19.gov/covid-19-vaccines-in-florida/', tcourl='https://t.co/h4jdZoPS1z', indices=(185, 208))]
 
13
[TextLink(text='floridahealthcovid19.gov/covid-19-vacci…', url='https://floridahealthcovid19.gov/covid-19-vaccines-in-florida/', tcourl='https://t.co/h4jdZoPS1z', indices=(182, 205))]
 
11
[TextLink(text='floridahealthcovid19.gov/covid-19-vacci…', url='https://floridahealthcovid19.gov/covid-19-vaccines-in-florida/', tcourl='https://t.co/h4jdZoPS1z', indices=(118, 141))]
 
10
Other values (135016)
135777 

Unique

Unique134360 ?
Unique (%)98.9%

Sample

1st row[TextLink(text='msnbc.com/hallie-jackson…', url='https://www.msnbc.com/hallie-jackson/watch/sen-baldwin-gives-walkthrough-of-start-of-impeachment-trial-76703301596', tcourl='https://t.co/RheooZ2wAS', indices=(257, 280))]
2nd row[TextLink(text='twitter.com/washingtonpost…', url='https://twitter.com/washingtonpost/status/1392118564522450945', tcourl='https://t.co/1K852radFj', indices=(258, 281))]
3rd row[TextLink(text='pscp.tv/w/1YpKkzVleAdxj', url='https://www.pscp.tv/w/1YpKkzVleAdxj', tcourl='https://t.co/r5qSH6gLrQ', indices=(175, 198))]
4th row[TextLink(text='pscp.tv/w/c12eZzMyNzU3…', url='https://www.pscp.tv/w/c12eZzMyNzU3OTl8MVlwS2t6VmxlQWR4alUsEo3Yq3sZiCFRj5Jln2xi58FtnUQ-2Gv4cvFqMlXs', tcourl='https://t.co/TRy6ZqH77Y', indices=(173, 196))]
5th row[TextLink(text='live.berniesanders.com', url='http://live.berniesanders.com', tcourl='https://t.co/vreIiWfeoS', indices=(157, 180))]

Common Values

ValueCountFrequency (%)
[TextLink(text='floridahealthcovid19.gov/covid-19-vacci…', url='https://floridahealthcovid19.gov/covid-19-vaccines-in-florida/', tcourl='https://t.co/h4jdZoPS1z', indices=(125, 148))] 15
 
< 0.1%
[TextLink(text='floridahealthcovid19.gov/covid-19-vacci…', url='https://floridahealthcovid19.gov/covid-19-vaccines-in-florida/', tcourl='https://t.co/h4jdZoPS1z', indices=(119, 142))] 14
 
< 0.1%
[TextLink(text='floridahealthcovid19.gov/covid-19-vacci…', url='https://floridahealthcovid19.gov/covid-19-vaccines-in-florida/', tcourl='https://t.co/h4jdZoPS1z', indices=(185, 208))] 13
 
< 0.1%
[TextLink(text='floridahealthcovid19.gov/covid-19-vacci…', url='https://floridahealthcovid19.gov/covid-19-vaccines-in-florida/', tcourl='https://t.co/h4jdZoPS1z', indices=(182, 205))] 11
 
< 0.1%
[TextLink(text='floridahealthcovid19.gov/covid-19-vacci…', url='https://floridahealthcovid19.gov/covid-19-vaccines-in-florida/', tcourl='https://t.co/h4jdZoPS1z', indices=(118, 141))] 10
 
< 0.1%
[TextLink(text='edmarkey.com', url='http://edmarkey.com', tcourl='https://t.co/qoQeZe2aqv', indices=(228, 251))] 7
 
< 0.1%
[TextLink(text='floridahealthcovid19.gov/covid-19-vacci…', url='https://floridahealthcovid19.gov/covid-19-vaccines-in-florida/', tcourl='https://t.co/h4jdZoPS1z', indices=(177, 200))] 6
 
< 0.1%
[TextLink(text='rubio.senate.gov/public/_cache/…', url='https://www.rubio.senate.gov/public/_cache/files/20e74c90-ac60-4227-a7fd-a50c5a429481/D10C63C7653FABA5F5F5EC74FF4E2B21.usc-abroad-help.pdf', tcourl='https://t.co/eobqWCTM56', indices=(228, 251))] 6
 
< 0.1%
[TextLink(text='ronwis.co/ronsroadtour', url='http://ronwis.co/ronsroadtour', tcourl='https://t.co/VeOasvEWkH', indices=(69, 92))] 5
 
< 0.1%
[TextLink(text='rubio.senate.gov/public/_cache/…', url='https://www.rubio.senate.gov/public/_cache/files/3aba21e8-3fb3-4844-a217-1e8fb3334e93/C8D670B7E3FF7FEFAE1B749369D55054.paycheck-protection-program-faqs-for-small-businesses-in-spanish-final.pdf', tcourl='https://t.co/Uyfcm5SOdQ', indices=(256, 279))] 5
 
< 0.1%
Other values (135011) 135748
32.0%
(Missing) 288127
68.0%

place
Categorical

HIGH CARDINALITY  MISSING 

Distinct199
Distinct (%)24.8%
Missing423163
Missing (%)99.8%
Memory size3.2 MiB
Place(id='01fbe706f872cb32', fullName='Washington, DC', name='Washington', type='city', country='United States', countryCode='US')
243 
Place(id='7d893ca2441b0c21', fullName='North Dakota, USA', name='North Dakota', type='admin', country='United States', countryCode='US')
 
32
Place(id='3cd12646c811c87e', fullName='Bismarck, ND', name='Bismarck', type='city', country='United States', countryCode='US')
 
30
Place(id='71f2805dd75bc147', fullName='Charleston, WV', name='Charleston', type='city', country='United States', countryCode='US')
 
22
Place(id='3f5897b87d2bf56c', fullName='Delaware, USA', name='Delaware', type='admin', country='United States', countryCode='US')
 
19
Other values (194)
458 

Unique

Unique111 ?
Unique (%)13.8%

Sample

1st rowPlace(id='7dc5c6d3bfb10ccc', fullName='Wisconsin, USA', name='Wisconsin', type='admin', country='United States', countryCode='US')
2nd rowPlace(id='7dc5c6d3bfb10ccc', fullName='Wisconsin, USA', name='Wisconsin', type='admin', country='United States', countryCode='US')
3rd rowPlace(id='e06ed4324b139bf2', fullName='Cedar Rapids, IA', name='Cedar Rapids', type='city', country='United States', countryCode='US')
4th rowPlace(id='0af98038b97a9d57', fullName='Toledo, IA', name='Toledo', type='city', country='United States', countryCode='US')
5th rowPlace(id='2c448dd9c022d778', fullName='Albia, IA', name='Albia', type='city', country='United States', countryCode='US')

Common Values

ValueCountFrequency (%)
Place(id='01fbe706f872cb32', fullName='Washington, DC', name='Washington', type='city', country='United States', countryCode='US') 243
 
0.1%
Place(id='7d893ca2441b0c21', fullName='North Dakota, USA', name='North Dakota', type='admin', country='United States', countryCode='US') 32
 
< 0.1%
Place(id='3cd12646c811c87e', fullName='Bismarck, ND', name='Bismarck', type='city', country='United States', countryCode='US') 30
 
< 0.1%
Place(id='71f2805dd75bc147', fullName='Charleston, WV', name='Charleston', type='city', country='United States', countryCode='US') 22
 
< 0.1%
Place(id='3f5897b87d2bf56c', fullName='Delaware, USA', name='Delaware', type='admin', country='United States', countryCode='US') 19
 
< 0.1%
Place(id='1c67f9d9cbae7f69', fullName='Des Moines, IA', name='Des Moines', type='city', country='United States', countryCode='US') 17
 
< 0.1%
Place(id='27c45d804c777999', fullName='Kansas, USA', name='Kansas', type='admin', country='United States', countryCode='US') 16
 
< 0.1%
Place(id='e08aaac2b23fd3a3', fullName='Gallatin, TN', name='Gallatin', type='city', country='United States', countryCode='US') 12
 
< 0.1%
Place(id='2d83c71ce16cd187', fullName='West Virginia, USA', name='West Virginia', type='admin', country='United States', countryCode='US') 12
 
< 0.1%
Place(id='319ee7b36c9149da', fullName='Arlington, VA', name='Arlington', type='city', country='United States', countryCode='US') 11
 
< 0.1%
Other values (189) 390
 
0.1%
(Missing) 423163
99.8%

quotedTweet
Categorical

HIGH CARDINALITY  MISSING 

Distinct57466
Distinct (%)88.7%
Missing359210
Missing (%)84.7%
Memory size3.2 MiB
https://twitter.com/SenatorLujan/status/1488618483017523205
 
37
https://twitter.com/SenatorEnzi/status/1419888831671635972
 
28
https://twitter.com/realDonaldTrump/status/1311892190680014849
 
27
https://twitter.com/SenatorRounds/status/1455641509928312840
 
24
https://twitter.com/NASA/status/1231954422785363968
 
20
Other values (57461)
64621 

Unique

Unique52932 ?
Unique (%)81.7%

Sample

1st rowhttps://twitter.com/mjhegar/status/1226869637255716865
2nd rowhttps://twitter.com/OutFrontCNN/status/1216881777265999874
3rd rowhttps://twitter.com/fightfor15/status/1392837114400690181
4th rowhttps://twitter.com/washingtonpost/status/1392118564522450945
5th rowhttps://twitter.com/CivicYouth/status/1387776959590043650

Common Values

ValueCountFrequency (%)
https://twitter.com/SenatorLujan/status/1488618483017523205 37
 
< 0.1%
https://twitter.com/SenatorEnzi/status/1419888831671635972 28
 
< 0.1%
https://twitter.com/realDonaldTrump/status/1311892190680014849 27
 
< 0.1%
https://twitter.com/SenatorRounds/status/1455641509928312840 24
 
< 0.1%
https://twitter.com/NASA/status/1231954422785363968 20
 
< 0.1%
https://twitter.com/USNavy/status/1448279894719438854 19
 
< 0.1%
https://twitter.com/POTUS/status/1562462774969581570 19
 
< 0.1%
https://twitter.com/ChrisVanHollen/status/1526027910783680513 17
 
< 0.1%
https://twitter.com/USMC/status/1326168775188766721 16
 
< 0.1%
https://twitter.com/politico/status/1521288272021901312 15
 
< 0.1%
Other values (57456) 64535
 
15.2%
(Missing) 359210
84.7%

retweetedTweet
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing423967
Missing (%)100.0%
Memory size3.2 MiB

sourceLabel
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing423967
Missing (%)100.0%
Memory size3.2 MiB

sourceUrl
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing423967
Missing (%)100.0%
Memory size3.2 MiB

url
Categorical

Distinct423612
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
https://twitter.com/CoryGardner/status/1295483839565856768
 
2
https://twitter.com/CoryGardner/status/1234868300884000771
 
2
https://twitter.com/CoryGardner/status/1232779505279938561
 
2
https://twitter.com/CoryGardner/status/1233091991015055362
 
2
https://twitter.com/CoryGardner/status/1233106983936516096
 
2
Other values (423607)
423957 

Unique

Unique423257 ?
Unique (%)99.8%

Sample

1st rowhttps://twitter.com/tammybaldwin/status/1226988645044891649
2nd rowhttps://twitter.com/tammybaldwin/status/1217185789156777986
3rd rowhttps://twitter.com/tammybaldwin/status/1216891845692837888
4th rowhttps://twitter.com/BernieSanders/status/1394083996653563907
5th rowhttps://twitter.com/BernieSanders/status/1393972598237696007

Common Values

ValueCountFrequency (%)
https://twitter.com/CoryGardner/status/1295483839565856768 2
 
< 0.1%
https://twitter.com/CoryGardner/status/1234868300884000771 2
 
< 0.1%
https://twitter.com/CoryGardner/status/1232779505279938561 2
 
< 0.1%
https://twitter.com/CoryGardner/status/1233091991015055362 2
 
< 0.1%
https://twitter.com/CoryGardner/status/1233106983936516096 2
 
< 0.1%
https://twitter.com/CoryGardner/status/1233172655060156416 2
 
< 0.1%
https://twitter.com/CoryGardner/status/1233492502910570496 2
 
< 0.1%
https://twitter.com/CoryGardner/status/1233502239416815619 2
 
< 0.1%
https://twitter.com/CoryGardner/status/1233903840358887424 2
 
< 0.1%
https://twitter.com/CoryGardner/status/1234546239166799872 2
 
< 0.1%
Other values (423602) 423947
> 99.9%

date
Categorical

Distinct398624
Distinct (%)94.0%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
2021-07-15 12:08:59+00:00
 
6
2021-05-11 22:11:42+00:00
 
6
2021-01-19 15:27:52+00:00
 
5
2020-05-01 23:30:35+00:00
 
5
2020-05-19 16:24:02+00:00
 
5
Other values (398619)
423940 

Unique

Unique377653 ?
Unique (%)89.1%

Sample

1st row2020-02-10 21:57:43+00:00
2nd row2020-01-14 20:44:41+00:00
3rd row2020-01-14 01:16:39+00:00
4th row2021-05-17 00:15:00+00:00
5th row2021-05-16 16:52:20+00:00

Common Values

ValueCountFrequency (%)
2021-07-15 12:08:59+00:00 6
 
< 0.1%
2021-05-11 22:11:42+00:00 6
 
< 0.1%
2021-01-19 15:27:52+00:00 5
 
< 0.1%
2020-05-01 23:30:35+00:00 5
 
< 0.1%
2020-05-19 16:24:02+00:00 5
 
< 0.1%
2020-10-27 00:38:00+00:00 5
 
< 0.1%
2020-05-16 15:40:58+00:00 5
 
< 0.1%
2020-11-04 02:10:50+00:00 5
 
< 0.1%
2020-09-16 20:06:26+00:00 5
 
< 0.1%
2021-07-14 14:19:09+00:00 5
 
< 0.1%
Other values (398614) 423915
> 99.9%

replyCount
Categorical

Distinct6433
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
1
 
17320
2
 
16967
3
 
16151
4
 
15258
5
 
14121
Other values (6428)
344150 

Unique

Unique2901 ?
Unique (%)0.7%

Sample

1st row0
2nd row13
3rd row11
4th row509
5th row10672

Common Values

ValueCountFrequency (%)
1 17320
 
4.1%
2 16967
 
4.0%
3 16151
 
3.8%
4 15258
 
3.6%
5 14121
 
3.3%
6 12750
 
3.0%
7 11775
 
2.8%
8 10902
 
2.6%
0 9954
 
2.3%
9 9909
 
2.3%
Other values (6423) 288860
68.1%

retweetCount
Categorical

Distinct8781
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
3
 
19149
2
 
18415
4
 
18308
5
 
16424
6
 
14641
Other values (8776)
337030 

Unique

Unique3580 ?
Unique (%)0.8%

Sample

1st row21
2nd row8
3rd row9
4th row1291
5th row34549

Common Values

ValueCountFrequency (%)
3 19149
 
4.5%
2 18415
 
4.3%
4 18308
 
4.3%
5 16424
 
3.9%
6 14641
 
3.5%
1 14188
 
3.3%
7 13019
 
3.1%
8 11493
 
2.7%
9 10128
 
2.4%
10 8565
 
2.0%
Other values (8771) 279637
66.0%

likeCount
Categorical

Distinct22039
Distinct (%)5.2%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
9
 
4759
8
 
4718
10
 
4694
11
 
4653
12
 
4618
Other values (22034)
400525 

Unique

Unique10908 ?
Unique (%)2.6%

Sample

1st row74
2nd row22
3rd row68
4th row6564
5th row151031

Common Values

ValueCountFrequency (%)
9 4759
 
1.1%
8 4718
 
1.1%
10 4694
 
1.1%
11 4653
 
1.1%
12 4618
 
1.1%
7 4579
 
1.1%
13 4483
 
1.1%
14 4394
 
1.0%
6 4362
 
1.0%
15 4264
 
1.0%
Other values (22029) 378443
89.3%

quoteCount
Categorical

HIGH CARDINALITY  IMBALANCE 

Distinct2673
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
0
101761 
1
67478 
2
41844 
3
27332 
4
19573 
Other values (2668)
165979 

Unique

Unique1136 ?
Unique (%)0.3%

Sample

1st row1
2nd row0
3rd row1
4th row83
5th row3344

Common Values

ValueCountFrequency (%)
0 101761
24.0%
1 67478
15.9%
2 41844
 
9.9%
3 27332
 
6.4%
4 19573
 
4.6%
5 14944
 
3.5%
6 11681
 
2.8%
7 9489
 
2.2%
8 7906
 
1.9%
9 6777
 
1.6%
Other values (2663) 115182
27.2%

conversationId
Categorical

Distinct370501
Distinct (%)87.4%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
1606376499900166145
 
176
1408239786628849668
 
92
1560436753105711104
 
80
1606308003740667905
 
56
1253160205912768519
 
49
Other values (370496)
423514 

Unique

Unique339421 ?
Unique (%)80.1%

Sample

1st row1226988645044891649
2nd row1217185789156777986
3rd row1216891845692837888
4th row1394083996653563907
5th row1393972598237696007

Common Values

ValueCountFrequency (%)
1606376499900166145 176
 
< 0.1%
1408239786628849668 92
 
< 0.1%
1560436753105711104 80
 
< 0.1%
1606308003740667905 56
 
< 0.1%
1253160205912768519 49
 
< 0.1%
1310293038967787520 47
 
< 0.1%
1304080286679019521 42
 
< 0.1%
1334182615939772417 37
 
< 0.1%
1312818478835994624 36
 
< 0.1%
1341812156640227329 35
 
< 0.1%
Other values (370491) 423317
99.8%

lang
Categorical

Distinct41
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
en
414556 
es
 
3288
zxx
 
2569
art
 
962
qme
 
692
Other values (36)
 
1900

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowen
2nd rowen
3rd rowen
4th rowen
5th rowen

Common Values

ValueCountFrequency (%)
en 414556
97.8%
es 3288
 
0.8%
zxx 2569
 
0.6%
art 962
 
0.2%
qme 692
 
0.2%
und 359
 
0.1%
qst 338
 
0.1%
qht 286
 
0.1%
fr 217
 
0.1%
qam 88
 
< 0.1%
Other values (31) 612
 
0.1%

source
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing423967
Missing (%)100.0%
Memory size3.2 MiB

contains_keyword
Categorical

HIGH CARDINALITY  IMBALANCE 

Distinct1170
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
none
374940 
pandemic
 
12092
ppe
 
9306
China
 
5505
corona
 
4038
Other values (1165)
 
18086

Unique

Unique681 ?
Unique (%)0.2%

Sample

1st rownone
2nd rownone
3rd rownone
4th rownone
5th rownone

Common Values

ValueCountFrequency (%)
none 374940
88.4%
pandemic 12092
 
2.9%
ppe 9306
 
2.2%
China 5505
 
1.3%
corona 4038
 
1.0%
vaccine 3709
 
0.9%
Coronavirus 1143
 
0.3%
corona,pandemic 1042
 
0.2%
CDC 908
 
0.2%
China,China 899
 
0.2%
Other values (1160) 10385
 
2.4%
\ No newline at end of file diff --git a/data/OUT/profiles/CovTweets.html b/data/OUT/profiles/CovTweets.html new file mode 100644 index 0000000..2805b72 --- /dev/null +++ b/data/OUT/profiles/CovTweets.html @@ -0,0 +1,5086 @@ +Pandas Profiling Report

Overview

Dataset statistics

Number of variables52
Number of observations49886
Missing cells350804
Missing cells (%)13.5%
Total size in memory19.8 MiB
Average record size in memory416.0 B

Variable types

Categorical52

Alerts

contains_keyword has constant value "True"Constant
id has a high cardinality: 49886 distinct valuesHigh cardinality
tid has a high cardinality: 49856 distinct valuesHigh cardinality
index_x has a high cardinality: 49856 distinct valuesHigh cardinality
user.id has a high cardinality: 167 distinct valuesHigh cardinality
user.username has a high cardinality: 167 distinct valuesHigh cardinality
user.created has a high cardinality: 167 distinct valuesHigh cardinality
user.favouritesCount has a high cardinality: 157 distinct valuesHigh cardinality
user.followersCount has a high cardinality: 183 distinct valuesHigh cardinality
user.friendsCount has a high cardinality: 162 distinct valuesHigh cardinality
rawContent has a high cardinality: 49684 distinct valuesHigh cardinality
inReplyToTweetId has a high cardinality: 5102 distinct valuesHigh cardinality
inReplyToUser has a high cardinality: 209 distinct valuesHigh cardinality
media has a high cardinality: 11168 distinct valuesHigh cardinality
mentionedUsers has a high cardinality: 6517 distinct valuesHigh cardinality
links has a high cardinality: 19761 distinct valuesHigh cardinality
quotedTweet has a high cardinality: 6010 distinct valuesHigh cardinality
url has a high cardinality: 49841 distinct valuesHigh cardinality
date has a high cardinality: 48359 distinct valuesHigh cardinality
replyCount has a high cardinality: 2208 distinct valuesHigh cardinality
retweetCount has a high cardinality: 3351 distinct valuesHigh cardinality
likeCount has a high cardinality: 6330 distinct valuesHigh cardinality
quoteCount has a high cardinality: 1042 distinct valuesHigh cardinality
conversationId has a high cardinality: 46570 distinct valuesHigh cardinality
keywords has a high cardinality: 1734 distinct valuesHigh cardinality
level_0 has a high cardinality: 109 distinct valuesHigh cardinality
index_y has a high cardinality: 109 distinct valuesHigh cardinality
name has a high cardinality: 109 distinct valuesHigh cardinality
id.1 has a high cardinality: 109 distinct valuesHigh cardinality
ideology has a high cardinality: 108 distinct valuesHigh cardinality
vote_share has a high cardinality: 96 distinct valuesHigh cardinality
next_closest_share has a high cardinality: 89 distinct valuesHigh cardinality
alt_handle has a high cardinality: 68 distinct valuesHigh cardinality
date_of_birth has a high cardinality: 109 distinct valuesHigh cardinality
edu_information has a high cardinality: 109 distinct valuesHigh cardinality
twitter_handle has a high cardinality: 62 distinct valuesHigh cardinality
tweetLen has a high cardinality: 318 distinct valuesHigh cardinality
quoteCount is highly imbalanced (52.5%)Imbalance
keywords is highly imbalanced (53.7%)Imbalance
end_serving is highly imbalanced (82.1%)Imbalance
not_in_office is highly imbalanced (50.3%)Imbalance
last_congress is highly imbalanced (68.6%)Imbalance
ethnicity is highly imbalanced (63.2%)Imbalance
inReplyToTweetId has 44783 (89.8%) missing valuesMissing
inReplyToUser has 44783 (89.8%) missing valuesMissing
media has 38446 (77.1%) missing valuesMissing
mentionedUsers has 34786 (69.7%) missing valuesMissing
links has 29901 (59.9%) missing valuesMissing
place has 49853 (99.9%) missing valuesMissing
quotedTweet has 43370 (86.9%) missing valuesMissing
vote_share has 1460 (2.9%) missing valuesMissing
next_closest_share has 1460 (2.9%) missing valuesMissing
alt_handle has 22882 (45.9%) missing valuesMissing
twitter_handle has 39080 (78.3%) missing valuesMissing
id has unique valuesUnique

Reproduction

Analysis started2023-08-08 12:58:40.494734
Analysis finished2023-08-08 12:58:41.714468
Duration1.22 second
Software versionpandas-profiling v3.6.6
Download configurationconfig.json

Variables

id
Categorical

HIGH CARDINALITY  UNIQUE 

Distinct49886
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
0
 
1
33273
 
1
33251
 
1
33252
 
1
33253
 
1
Other values (49881)
49881 

Unique

Unique49886 ?
Unique (%)100.0%

Sample

1st row0
2nd row1
3rd row2
4th row3
5th row4

Common Values

ValueCountFrequency (%)
0 1
 
< 0.1%
33273 1
 
< 0.1%
33251 1
 
< 0.1%
33252 1
 
< 0.1%
33253 1
 
< 0.1%
33254 1
 
< 0.1%
33255 1
 
< 0.1%
33256 1
 
< 0.1%
33257 1
 
< 0.1%
33258 1
 
< 0.1%
Other values (49876) 49876
> 99.9%

tid
Categorical

Distinct49856
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
134633
 
2
132331
 
2
130314
 
2
44507
 
2
44506
 
2
Other values (49851)
49876 

Unique

Unique49826 ?
Unique (%)99.9%

Sample

1st row321
2nd row390
3rd row720
4th row1342
5th row2662

Common Values

ValueCountFrequency (%)
134633 2
 
< 0.1%
132331 2
 
< 0.1%
130314 2
 
< 0.1%
44507 2
 
< 0.1%
44506 2
 
< 0.1%
45746 2
 
< 0.1%
45747 2
 
< 0.1%
132349 2
 
< 0.1%
128516 2
 
< 0.1%
128515 2
 
< 0.1%
Other values (49846) 49866
> 99.9%

index_x
Categorical

Distinct49856
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
47621
 
2
47643
 
2
47658
 
2
262883
 
2
42817
 
2
Other values (49851)
49876 

Unique

Unique49826 ?
Unique (%)99.9%

Sample

1st row195510
2nd row112379
3rd row372950
4th row224601
5th row241535

Common Values

ValueCountFrequency (%)
47621 2
 
< 0.1%
47643 2
 
< 0.1%
47658 2
 
< 0.1%
262883 2
 
< 0.1%
42817 2
 
< 0.1%
42815 2
 
< 0.1%
262881 2
 
< 0.1%
139352 2
 
< 0.1%
47679 2
 
< 0.1%
139390 2
 
< 0.1%
Other values (49846) 49866
> 99.9%

user.id
Categorical

Distinct167
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
293131808
 
1503
109287731
 
1314
13218102
 
1157
247334603
 
1111
818554054309715969
 
959
Other values (162)
43842 

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row22195441
2nd row33537967
3rd row22044727
4th row278124059
5th row818554054309715969

Common Values

ValueCountFrequency (%)
293131808 1503
 
3.0%
109287731 1314
 
2.6%
13218102 1157
 
2.3%
247334603 1111
 
2.2%
818554054309715969 959
 
1.9%
18695134 838
 
1.7%
18915145 832
 
1.7%
970207298 822
 
1.6%
17494010 782
 
1.6%
811313565760163844 756
 
1.5%
Other values (157) 39812
79.8%

user.username
Categorical

Distinct167
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
pattymurray
 
1503
senatorshaheen
 
1314
johncornyn
 
1157
senatordurbin
 
1111
senjackyrosen
 
959
Other values (162)
43842 

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowthomtillis
2nd rowamyklobuchar
3rd rowsenamyklobuchar
4th rowsenblumenthal
5th rowsenjackyrosen

Common Values

ValueCountFrequency (%)
pattymurray 1503
 
3.0%
senatorshaheen 1314
 
2.6%
johncornyn 1157
 
2.3%
senatordurbin 1111
 
2.2%
senjackyrosen 959
 
1.9%
senatormenendez 838
 
1.7%
senrobportman 832
 
1.7%
senwarren 822
 
1.6%
senschumer 782
 
1.6%
sencortezmasto 756
 
1.5%
Other values (157) 39812
79.8%

user.created
Categorical

Distinct167
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
2011-05-04 20:23:35+00:00
 
1503
2010-01-28 15:22:44+00:00
 
1314
2008-02-07 19:52:55+00:00
 
1157
2011-02-04 15:50:42+00:00
 
1111
2017-01-09 20:24:29+00:00
 
959
Other values (162)
43842 

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row2009-02-27 21:49:53+00:00
2nd row2009-04-20 14:59:36+00:00
3rd row2009-02-26 18:50:05+00:00
4th row2011-04-06 17:13:53+00:00
5th row2017-01-09 20:24:29+00:00

Common Values

ValueCountFrequency (%)
2011-05-04 20:23:35+00:00 1503
 
3.0%
2010-01-28 15:22:44+00:00 1314
 
2.6%
2008-02-07 19:52:55+00:00 1157
 
2.3%
2011-02-04 15:50:42+00:00 1111
 
2.2%
2017-01-09 20:24:29+00:00 959
 
1.9%
2009-01-06 21:01:12+00:00 838
 
1.7%
2009-01-12 20:56:42+00:00 832
 
1.7%
2012-11-25 15:14:27+00:00 822
 
1.6%
2008-11-19 20:10:20+00:00 782
 
1.6%
2016-12-20 20:53:22+00:00 756
 
1.5%
Other values (157) 39812
79.8%
Distinct157
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
510
 
1545
311
 
1503
1568
 
1343
10678
 
1157
835
 
1111
Other values (152)
43227 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row408
2nd row18
3rd row47
4th row1406
5th row732

Common Values

ValueCountFrequency (%)
510 1545
 
3.1%
311 1503
 
3.0%
1568 1343
 
2.7%
10678 1157
 
2.3%
835 1111
 
2.2%
732 959
 
1.9%
3742 838
 
1.7%
467 832
 
1.7%
25 822
 
1.6%
1724 756
 
1.5%
Other values (147) 39020
78.2%
Distinct183
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
490598
 
1503
129843
 
1314
369869
 
1157
739969
 
1111
50827
 
959
Other values (178)
43842 

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row40435
2nd row2044344
3rd row96125
4th row709072
5th row50827

Common Values

ValueCountFrequency (%)
490598 1503
 
3.0%
129843 1314
 
2.6%
369869 1157
 
2.3%
739969 1111
 
2.2%
50827 959
 
1.9%
217777 838
 
1.7%
168873 832
 
1.7%
6986509 822
 
1.6%
75483 756
 
1.5%
199036 754
 
1.5%
Other values (173) 39840
79.9%
Distinct162
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
2403
 
1503
2822
 
1314
13109
 
1157
2418
 
1111
817
 
959
Other values (157)
43842 

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row5615
2nd row124444
3rd row4778
4th row982
5th row817

Common Values

ValueCountFrequency (%)
2403 1503
 
3.0%
2822 1314
 
2.6%
13109 1157
 
2.3%
2418 1111
 
2.2%
817 959
 
1.9%
544 906
 
1.8%
1058 838
 
1.7%
4994 832
 
1.7%
508 822
 
1.6%
23160 782
 
1.6%
Other values (152) 39662
79.5%

rawContent
Categorical

Distinct49684
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
Are you a Floridian seeking a #COVID19 vaccine? 💉 @HealthyFla provides a complete list of vaccination sites in #Florida. 👇 https://t.co/h4jdZoPS1z
 
15
¿Es usted un floridiano buscando en dónde ponerse la vacuna contra el #COVID19? 💉 @HealthyFla le ofrece una lista en inglés de las sedes de vacunación en el estado de la #Florida. 👇 https://t.co/h4jdZoPS1z
 
12
¿Es usted un floridiano buscando dónde poder ponerse la vacuna contra el #COVID19? 👀 Aquí encontrará una lista en inglés de las sedes de vacunación en el estado de la #Florida. 👇 https://t.co/h4jdZoPS1z
 
10
Are you a Floridian seeking a #COVID19 vaccine? 👀 A complete list of #Florida vaccination sites can be found here. 👇 https://t.co/h4jdZoPS1z
 
9
Wear a mask.
 
9
Other values (49679)
49831 

Unique

Unique49594 ?
Unique (%)99.4%

Sample

1st rowEven my dog Mitch is tired of the political games Pelosi and Schumer are playing when it comes to impeachment. https://t.co/BcDCbyuCk3
2nd rowbreaking: Duffey Email says: “Clear direction from POTUS to continue to hold” aid to Ukraine. We need him as a witness to testify and Mitch McConnell has no excuse. https://t.co/Awn955wZ8v
3rd row505 people have been hospitalized from #flu in MN this season &amp; eight people have died. Every year #fluvaccine prevents millions of illnesses, 10s of thousands of hospitalizations, and thousands of deaths. Here's where to get vaccinated in MN to #fightflu: https://t.co/ktfnSM9BqL https://t.co/aozD17TzMO
4th rowDeeply concerning issues—beyond MCAS software—require new Boeing CEO Calhoun to come clean on all facts. His credibility is at stake. Full transparency to the public &amp; FAA. https://t.co/wfhngnB0g1
5th rowIt was my honor to give opening remarks at today's U.S. Commission on International Religious Freedom hearing on the need to combat anti-Semitism. I launched the Senate Bipartisan Task Force for Combating Anti-Semitism because this epidemic must be taken seriously. https://t.co/vgGBjawI4U

Common Values

ValueCountFrequency (%)
Are you a Floridian seeking a #COVID19 vaccine? 💉 @HealthyFla provides a complete list of vaccination sites in #Florida. 👇 https://t.co/h4jdZoPS1z 15
 
< 0.1%
¿Es usted un floridiano buscando en dónde ponerse la vacuna contra el #COVID19? 💉 @HealthyFla le ofrece una lista en inglés de las sedes de vacunación en el estado de la #Florida. 👇 https://t.co/h4jdZoPS1z 12
 
< 0.1%
¿Es usted un floridiano buscando dónde poder ponerse la vacuna contra el #COVID19? 👀 Aquí encontrará una lista en inglés de las sedes de vacunación en el estado de la #Florida. 👇 https://t.co/h4jdZoPS1z 10
 
< 0.1%
Are you a Floridian seeking a #COVID19 vaccine? 👀 A complete list of #Florida vaccination sites can be found here. 👇 https://t.co/h4jdZoPS1z 9
 
< 0.1%
Wear a mask. 9
 
< 0.1%
¿Es usted un floridiano buscando en dónde ponerse la vacuna contra el #COVID19? 💉 Aquí encontrará una lista en inglés de las sedes de vacunación en el estado de la #Florida 👇 https://t.co/h4jdZoPS1z 6
 
< 0.1%
Are you a Floridian seeking a #COVID19 vaccine? 💉 A complete list of #Florida vaccination sites can be found here👇 https://t.co/h4jdZoPS1z 6
 
< 0.1%
Are you a Floridian seeking a #COVID19 vaccine? 💉 A complete list of #Florida vaccination sites can be found here 👇 https://t.co/h4jdZoPS1z 5
 
< 0.1%
You are either helping right now, or hurting. Pelosi’s handpicked partisan ploys are helping no one. She’s hurting American families by prioritizing airplane emissions, same day voter registration&amp;wind energy tax credits over #coronavirus relief. #StopTheGamesNancy, #PassTheBill 5
 
< 0.1%
There’s a lot of information coming out about #COVID19. It’s important that West Virginians stay informed, and that’s why I’m sending out a daily coronavirus newsletter. Subscribe here 📧 https://t.co/cbYJ0VvRAX 5
 
< 0.1%
Other values (49674) 49804
99.8%

inReplyToTweetId
Categorical

HIGH CARDINALITY  MISSING 

Distinct5102
Distinct (%)> 99.9%
Missing44783
Missing (%)89.8%
Memory size389.9 KiB
1.2410988044152259e+18
 
2
1.3435600653836e+18
 
1
1.3415805666726584e+18
 
1
1.3415449948349235e+18
 
1
1.3415302064152945e+18
 
1
Other values (5097)
5097 

Unique

Unique5101 ?
Unique (%)> 99.9%

Sample

1st row1.2150523967889654e+18
2nd row1.2169090836531036e+18
3rd row1.217892151029162e+18
4th row1.219973311473574e+18
5th row1.2199733106305638e+18

Common Values

ValueCountFrequency (%)
1.2410988044152259e+18 2
 
< 0.1%
1.3435600653836e+18 1
 
< 0.1%
1.3415805666726584e+18 1
 
< 0.1%
1.3415449948349235e+18 1
 
< 0.1%
1.3415302064152945e+18 1
 
< 0.1%
1.3415065604938383e+18 1
 
< 0.1%
1.3414741242045317e+18 1
 
< 0.1%
1.3414661572163912e+18 1
 
< 0.1%
1.3414661586172928e+18 1
 
< 0.1%
1.3414465894502072e+18 1
 
< 0.1%
Other values (5092) 5092
 
10.2%
(Missing) 44783
89.8%

inReplyToUser
Categorical

HIGH CARDINALITY  MISSING 

Distinct209
Distinct (%)4.1%
Missing44783
Missing (%)89.8%
Memory size389.9 KiB
https://twitter.com/lisamurkowski
 
248
https://twitter.com/SenWarren
 
210
https://twitter.com/SenBobCasey
 
193
https://twitter.com/SenatorMenendez
 
181
https://twitter.com/PattyMurray
 
180
Other values (204)
4091 

Unique

Unique56 ?
Unique (%)1.1%

Sample

1st rowhttps://twitter.com/ChrisCoons
2nd rowhttps://twitter.com/JohnBoozman
3rd rowhttps://twitter.com/DanielCoronaNV
4th rowhttps://twitter.com/SenTomCotton
5th rowhttps://twitter.com/SenTomCotton

Common Values

ValueCountFrequency (%)
https://twitter.com/lisamurkowski 248
 
0.5%
https://twitter.com/SenWarren 210
 
0.4%
https://twitter.com/SenBobCasey 193
 
0.4%
https://twitter.com/SenatorMenendez 181
 
0.4%
https://twitter.com/PattyMurray 180
 
0.4%
https://twitter.com/ChrisCoons 173
 
0.3%
https://twitter.com/SenatorDurbin 144
 
0.3%
https://twitter.com/SenCortezMasto 123
 
0.2%
https://twitter.com/SenBlumenthal 117
 
0.2%
https://twitter.com/SenatorCarper 105
 
0.2%
Other values (199) 3429
 
6.9%
(Missing) 44783
89.8%

media
Categorical

HIGH CARDINALITY  MISSING 

Distinct11168
Distinct (%)97.6%
Missing38446
Missing (%)77.1%
Memory size389.9 KiB
[Video(thumbnailUrl='https://pbs.twimg.com/amplify_video_thumb/1291002994100764674/img/i8TuA2AeaYPTCYFY.jpg', variants=[VideoVariant(url='https://video.twimg.com/amplify_video/1291002994100764674/vid/480x270/dKbXcyMtkCN7GZml.mp4?tag=13', contentType='video/mp4', bitrate=288000), VideoVariant(url='https://video.twimg.com/amplify_video/1291002994100764674/pl/eqnXROeHFTWNXoIu.m3u8?tag=13', contentType='application/x-mpegURL', bitrate=None), VideoVariant(url='https://video.twimg.com/amplify_video/1291002994100764674/vid/640x360/tZaWUqn6FFiCgi6q.mp4?tag=13', contentType='video/mp4', bitrate=832000), VideoVariant(url='https://video.twimg.com/amplify_video/1291002994100764674/vid/1280x720/7MrfPW9eyM0n7xTW.mp4?tag=13', contentType='video/mp4', bitrate=2176000)], duration=30.072, views=3718, altText=None)]
 
7
[Video(thumbnailUrl='https://pbs.twimg.com/amplify_video_thumb/1327008380356091905/img/97aoeLoFlW1b4aYx.jpg', variants=[VideoVariant(url='https://video.twimg.com/amplify_video/1327008380356091905/vid/720x720/traeLGSvu9PukFMo.mp4?tag=13', contentType='video/mp4', bitrate=1280000), VideoVariant(url='https://video.twimg.com/amplify_video/1327008380356091905/vid/320x320/bvzIb7_YQo6-EZME.mp4?tag=13', contentType='video/mp4', bitrate=432000), VideoVariant(url='https://video.twimg.com/amplify_video/1327008380356091905/vid/480x480/_dnlxugOwYx3NXSQ.mp4?tag=13', contentType='video/mp4', bitrate=832000), VideoVariant(url='https://video.twimg.com/amplify_video/1327008380356091905/pl/JgoWPW8L8zRPYnvn.m3u8?tag=13', contentType='application/x-mpegURL', bitrate=None)], duration=567.4, views=64389, altText=None)]
 
6
[Video(thumbnailUrl='https://pbs.twimg.com/media/EVA7uHnXkAUOEnv.jpg', variants=[VideoVariant(url='https://video.twimg.com/amplify_video/1247562475089924099/vid/480x480/r1EZCycTVfSAjGEf.mp4?tag=13', contentType='video/mp4', bitrate=832000), VideoVariant(url='https://video.twimg.com/amplify_video/1247562475089924099/vid/320x320/9C-_LNOZ-mN0Vlfh.mp4?tag=13', contentType='video/mp4', bitrate=432000), VideoVariant(url='https://video.twimg.com/amplify_video/1247562475089924099/pl/DxcPDK7x7Zvei8Mi.m3u8?tag=13', contentType='application/x-mpegURL', bitrate=None), VideoVariant(url='https://video.twimg.com/amplify_video/1247562475089924099/vid/720x720/d1L-2RQmpxxlIqCx.mp4?tag=13', contentType='video/mp4', bitrate=1280000)], duration=74.408, views=40408, altText=None)]
 
6
[Video(thumbnailUrl='https://pbs.twimg.com/ext_tw_video_thumb/1235997093333958657/pu/img/0BpzML29aKZkdWXi.jpg', variants=[VideoVariant(url='https://video.twimg.com/ext_tw_video/1235997093333958657/pu/vid/640x360/nf_7qMpPmj5nqjHw.mp4?tag=10', contentType='video/mp4', bitrate=832000), VideoVariant(url='https://video.twimg.com/ext_tw_video/1235997093333958657/pu/vid/1280x720/UQ6Mm8r4hHpWEYcm.mp4?tag=10', contentType='video/mp4', bitrate=2176000), VideoVariant(url='https://video.twimg.com/ext_tw_video/1235997093333958657/pu/pl/wWezRq2ZFfIvL_vJ.m3u8?tag=10', contentType='application/x-mpegURL', bitrate=None), VideoVariant(url='https://video.twimg.com/ext_tw_video/1235997093333958657/pu/vid/480x270/pvByeCU5JfgklEDx.mp4?tag=10', contentType='video/mp4', bitrate=256000)], duration=3.462, views=15545, altText=None)]
 
6
[Video(thumbnailUrl='https://pbs.twimg.com/amplify_video_thumb/1294381698734596101/img/PrEMwcC427fnXn7Z.jpg', variants=[VideoVariant(url='https://video.twimg.com/amplify_video/1294381698734596101/vid/506x282/RGtuEW9iTI8oH9hf.mp4?tag=13', contentType='video/mp4', bitrate=288000), VideoVariant(url='https://video.twimg.com/amplify_video/1294381698734596101/pl/wvfTknqHtjj9pNUP.m3u8?tag=13', contentType='application/x-mpegURL', bitrate=None)], duration=6.649, views=1099, altText=None)]
 
5
Other values (11163)
11410 

Unique

Unique10965 ?
Unique (%)95.8%

Sample

1st row[Video(thumbnailUrl='https://pbs.twimg.com/ext_tw_video_thumb/1212787857150169088/pu/img/fzVsYPHqjGmQ4M_G.jpg', variants=[VideoVariant(url='https://video.twimg.com/ext_tw_video/1212787857150169088/pu/vid/540x960/oNCWznSbf-IwYuEw.mp4?tag=10', contentType='video/mp4', bitrate=2176000), VideoVariant(url='https://video.twimg.com/ext_tw_video/1212787857150169088/pu/pl/ymExlk43ys0C-XWF.m3u8?tag=10', contentType='application/x-mpegURL', bitrate=None), VideoVariant(url='https://video.twimg.com/ext_tw_video/1212787857150169088/pu/vid/320x568/lrBVTPBEFPS8raig.mp4?tag=10', contentType='video/mp4', bitrate=632000), VideoVariant(url='https://video.twimg.com/ext_tw_video/1212787857150169088/pu/vid/360x640/YLXm9aTEOgiZO0Hl.mp4?tag=10', contentType='video/mp4', bitrate=832000)], duration=74.388, views=7249, altText=None)]
2nd row[Gif(thumbnailUrl='https://pbs.twimg.com/tweet_video_thumb/ENXy3zdWsAAzNOJ.jpg', variants=[VideoVariant(url='https://video.twimg.com/tweet_video/ENXy3zdWsAAzNOJ.mp4', contentType='video/mp4', bitrate=0)], altText=None)]
3rd row[Video(thumbnailUrl='https://pbs.twimg.com/media/ENy2CgeW4AArzJG.jpg', variants=[VideoVariant(url='https://video.twimg.com/amplify_video/1215041466730532867/vid/480x270/coFsvYec26kPotw_.mp4?tag=13', contentType='video/mp4', bitrate=288000), VideoVariant(url='https://video.twimg.com/amplify_video/1215041466730532867/pl/EsQeeym3RZ9LBupY.m3u8?tag=13', contentType='application/x-mpegURL', bitrate=None), VideoVariant(url='https://video.twimg.com/amplify_video/1215041466730532867/vid/640x360/ozCuiQ4ah9VFZnLu.mp4?tag=13', contentType='video/mp4', bitrate=832000), VideoVariant(url='https://video.twimg.com/amplify_video/1215041466730532867/vid/1280x720/Ct93aoYSzy1l4j0c.mp4?tag=13', contentType='video/mp4', bitrate=2176000)], duration=494.06, views=779, altText=None)]
4th row[Photo(previewUrl='https://pbs.twimg.com/media/EOX5aaXW4AATn2U?format=jpg&name=small', fullUrl='https://pbs.twimg.com/media/EOX5aaXW4AATn2U?format=jpg&name=orig', altText=None)]
5th row[Photo(previewUrl='https://pbs.twimg.com/media/EObTYn9WoAEviCp?format=jpg&name=small', fullUrl='https://pbs.twimg.com/media/EObTYn9WoAEviCp?format=jpg&name=orig', altText=None)]

Common Values

ValueCountFrequency (%)
[Video(thumbnailUrl='https://pbs.twimg.com/amplify_video_thumb/1291002994100764674/img/i8TuA2AeaYPTCYFY.jpg', variants=[VideoVariant(url='https://video.twimg.com/amplify_video/1291002994100764674/vid/480x270/dKbXcyMtkCN7GZml.mp4?tag=13', contentType='video/mp4', bitrate=288000), VideoVariant(url='https://video.twimg.com/amplify_video/1291002994100764674/pl/eqnXROeHFTWNXoIu.m3u8?tag=13', contentType='application/x-mpegURL', bitrate=None), VideoVariant(url='https://video.twimg.com/amplify_video/1291002994100764674/vid/640x360/tZaWUqn6FFiCgi6q.mp4?tag=13', contentType='video/mp4', bitrate=832000), VideoVariant(url='https://video.twimg.com/amplify_video/1291002994100764674/vid/1280x720/7MrfPW9eyM0n7xTW.mp4?tag=13', contentType='video/mp4', bitrate=2176000)], duration=30.072, views=3718, altText=None)] 7
 
< 0.1%
[Video(thumbnailUrl='https://pbs.twimg.com/amplify_video_thumb/1327008380356091905/img/97aoeLoFlW1b4aYx.jpg', variants=[VideoVariant(url='https://video.twimg.com/amplify_video/1327008380356091905/vid/720x720/traeLGSvu9PukFMo.mp4?tag=13', contentType='video/mp4', bitrate=1280000), VideoVariant(url='https://video.twimg.com/amplify_video/1327008380356091905/vid/320x320/bvzIb7_YQo6-EZME.mp4?tag=13', contentType='video/mp4', bitrate=432000), VideoVariant(url='https://video.twimg.com/amplify_video/1327008380356091905/vid/480x480/_dnlxugOwYx3NXSQ.mp4?tag=13', contentType='video/mp4', bitrate=832000), VideoVariant(url='https://video.twimg.com/amplify_video/1327008380356091905/pl/JgoWPW8L8zRPYnvn.m3u8?tag=13', contentType='application/x-mpegURL', bitrate=None)], duration=567.4, views=64389, altText=None)] 6
 
< 0.1%
[Video(thumbnailUrl='https://pbs.twimg.com/media/EVA7uHnXkAUOEnv.jpg', variants=[VideoVariant(url='https://video.twimg.com/amplify_video/1247562475089924099/vid/480x480/r1EZCycTVfSAjGEf.mp4?tag=13', contentType='video/mp4', bitrate=832000), VideoVariant(url='https://video.twimg.com/amplify_video/1247562475089924099/vid/320x320/9C-_LNOZ-mN0Vlfh.mp4?tag=13', contentType='video/mp4', bitrate=432000), VideoVariant(url='https://video.twimg.com/amplify_video/1247562475089924099/pl/DxcPDK7x7Zvei8Mi.m3u8?tag=13', contentType='application/x-mpegURL', bitrate=None), VideoVariant(url='https://video.twimg.com/amplify_video/1247562475089924099/vid/720x720/d1L-2RQmpxxlIqCx.mp4?tag=13', contentType='video/mp4', bitrate=1280000)], duration=74.408, views=40408, altText=None)] 6
 
< 0.1%
[Video(thumbnailUrl='https://pbs.twimg.com/ext_tw_video_thumb/1235997093333958657/pu/img/0BpzML29aKZkdWXi.jpg', variants=[VideoVariant(url='https://video.twimg.com/ext_tw_video/1235997093333958657/pu/vid/640x360/nf_7qMpPmj5nqjHw.mp4?tag=10', contentType='video/mp4', bitrate=832000), VideoVariant(url='https://video.twimg.com/ext_tw_video/1235997093333958657/pu/vid/1280x720/UQ6Mm8r4hHpWEYcm.mp4?tag=10', contentType='video/mp4', bitrate=2176000), VideoVariant(url='https://video.twimg.com/ext_tw_video/1235997093333958657/pu/pl/wWezRq2ZFfIvL_vJ.m3u8?tag=10', contentType='application/x-mpegURL', bitrate=None), VideoVariant(url='https://video.twimg.com/ext_tw_video/1235997093333958657/pu/vid/480x270/pvByeCU5JfgklEDx.mp4?tag=10', contentType='video/mp4', bitrate=256000)], duration=3.462, views=15545, altText=None)] 6
 
< 0.1%
[Video(thumbnailUrl='https://pbs.twimg.com/amplify_video_thumb/1294381698734596101/img/PrEMwcC427fnXn7Z.jpg', variants=[VideoVariant(url='https://video.twimg.com/amplify_video/1294381698734596101/vid/506x282/RGtuEW9iTI8oH9hf.mp4?tag=13', contentType='video/mp4', bitrate=288000), VideoVariant(url='https://video.twimg.com/amplify_video/1294381698734596101/pl/wvfTknqHtjj9pNUP.m3u8?tag=13', contentType='application/x-mpegURL', bitrate=None)], duration=6.649, views=1099, altText=None)] 5
 
< 0.1%
[Video(thumbnailUrl='https://pbs.twimg.com/ext_tw_video_thumb/1316423494209466368/pu/img/Y-Ejy5wGE2rmCtgM.jpg', variants=[VideoVariant(url='https://video.twimg.com/ext_tw_video/1316423494209466368/pu/pl/AKjeh5OYeCsMF5tK.m3u8?tag=10', contentType='application/x-mpegURL', bitrate=None), VideoVariant(url='https://video.twimg.com/ext_tw_video/1316423494209466368/pu/vid/640x360/sw2mZQ3pqt98sOr9.mp4?tag=10', contentType='video/mp4', bitrate=832000), VideoVariant(url='https://video.twimg.com/ext_tw_video/1316423494209466368/pu/vid/1280x720/-WfAeH_Jn21Z3Fn3.mp4?tag=10', contentType='video/mp4', bitrate=2176000), VideoVariant(url='https://video.twimg.com/ext_tw_video/1316423494209466368/pu/vid/480x270/r6vAccQu2eS6J2n-.mp4?tag=10', contentType='video/mp4', bitrate=256000)], duration=31.398, views=5087, altText=None)] 4
 
< 0.1%
[Video(thumbnailUrl='https://pbs.twimg.com/media/EcSYCcAXgAA4-zx.jpg', variants=[VideoVariant(url='https://video.twimg.com/amplify_video/1280314299454558214/vid/480x270/5T2ZZLMIomkbwH7R.mp4?tag=13', contentType='video/mp4', bitrate=288000), VideoVariant(url='https://video.twimg.com/amplify_video/1280314299454558214/vid/1280x720/nkrX8VfUjjUGynrI.mp4?tag=13', contentType='video/mp4', bitrate=2176000), VideoVariant(url='https://video.twimg.com/amplify_video/1280314299454558214/vid/640x360/bKT4UiEXRlTD4kz4.mp4?tag=13', contentType='video/mp4', bitrate=832000), VideoVariant(url='https://video.twimg.com/amplify_video/1280314299454558214/pl/MsZ3_olToB6_HhQ8.m3u8?tag=13', contentType='application/x-mpegURL', bitrate=None)], duration=57.731, views=135655, altText=None)] 4
 
< 0.1%
[Video(thumbnailUrl='https://pbs.twimg.com/ext_tw_video_thumb/1305680649072922627/pu/img/1ntGZ5iB0EWv35ZS.jpg', variants=[VideoVariant(url='https://video.twimg.com/ext_tw_video/1305680649072922627/pu/vid/480x270/XjMvnu8fBdaNznZ1.mp4?tag=10', contentType='video/mp4', bitrate=256000), VideoVariant(url='https://video.twimg.com/ext_tw_video/1305680649072922627/pu/pl/FXRfXz_rNlzw1hOh.m3u8?tag=10', contentType='application/x-mpegURL', bitrate=None), VideoVariant(url='https://video.twimg.com/ext_tw_video/1305680649072922627/pu/vid/640x360/A0V5G_YZSu9DiKbd.mp4?tag=10', contentType='video/mp4', bitrate=832000), VideoVariant(url='https://video.twimg.com/ext_tw_video/1305680649072922627/pu/vid/1056x594/aKUzoyheJs9p3JuO.mp4?tag=10', contentType='video/mp4', bitrate=2176000)], duration=138.727, views=7667, altText=None)] 4
 
< 0.1%
[Gif(thumbnailUrl='https://pbs.twimg.com/tweet_video_thumb/ESxrAy2XkAEbhvB.jpg', variants=[VideoVariant(url='https://video.twimg.com/tweet_video/ESxrAy2XkAEbhvB.mp4', contentType='video/mp4', bitrate=0)], altText=None)] 4
 
< 0.1%
[Photo(previewUrl='https://pbs.twimg.com/media/EXWub6MU8AAqnpO?format=jpg&name=small', fullUrl='https://pbs.twimg.com/media/EXWub6MU8AAqnpO?format=jpg&name=orig', altText=None)] 4
 
< 0.1%
Other values (11158) 11390
 
22.8%
(Missing) 38446
77.1%

mentionedUsers
Categorical

HIGH CARDINALITY  MISSING 

Distinct6517
Distinct (%)43.2%
Missing34786
Missing (%)69.7%
Memory size389.9 KiB
[User(username='POTUS', id=1349149096909668363, displayname='President Biden', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)]
 
612
[User(username='realDonaldTrump', id=25073877, displayname='Donald J. Trump', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)]
 
487
[User(username='CDCgov', id=146569971, displayname='CDC', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)]
 
458
[User(username='JoeBiden', id=939091, displayname='Joe Biden', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)]
 
234
[User(username='HHSGov', id=44783853, displayname='HHS.gov', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)]
 
184
Other values (6512)
13125 

Unique

Unique5098 ?
Unique (%)33.8%

Sample

1st row[User(username='Apple', id=380749300, displayname='Apple', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None), User(username='iTunes', id=66515223, displayname='iTunes', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)]
2nd row[User(username='NIH', id=15134240, displayname='NIH', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None), User(username='theNCI', id=38530021, displayname='National Cancer Institute', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None), User(username='CDCgov', id=146569971, displayname='CDC', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)]
3rd row[User(username='Rotary', id=4432431, displayname='Rotary International', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None), User(username='CDCgov', id=146569971, displayname='CDC', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None), User(username='UNICEF', id=33933259, displayname='UNICEF', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)]
4th row[User(username='janeosanders', id=701081673681637376, displayname="Jane O'Meara Sanders", rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)]
5th row[User(username='tperkins', id=18958999, displayname='Tony Perkins', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None), User(username='FRCdc', id=18163042, displayname='Family Research Council', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)]

Common Values

ValueCountFrequency (%)
[User(username='POTUS', id=1349149096909668363, displayname='President Biden', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)] 612
 
1.2%
[User(username='realDonaldTrump', id=25073877, displayname='Donald J. Trump', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)] 487
 
1.0%
[User(username='CDCgov', id=146569971, displayname='CDC', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)] 458
 
0.9%
[User(username='JoeBiden', id=939091, displayname='Joe Biden', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)] 234
 
0.5%
[User(username='HHSGov', id=44783853, displayname='HHS.gov', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)] 184
 
0.4%
[User(username='USDA', id=61853389, displayname='Dept. of Agriculture', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)] 170
 
0.3%
[User(username='SBAgov', id=153149305, displayname='SBA', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)] 156
 
0.3%
[User(username='WHO', id=14499829, displayname='World Health Organization (WHO)', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)] 128
 
0.3%
[User(username='SenateDems', id=73238146, displayname='Senate Democrats', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)] 118
 
0.2%
[User(username='WSJ', id=3108351, displayname='The Wall Street Journal', rawDescription=None, renderedDescription=None, descriptionLinks=None, verified=None, created=None, followersCount=None, friendsCount=None, statusesCount=None, favouritesCount=None, listedCount=None, mediaCount=None, location=None, protected=None, link=None, profileImageUrl=None, profileBannerUrl=None, label=None, blue=None, blueType=None)] 96
 
0.2%
Other values (6507) 12457
 
25.0%
(Missing) 34786
69.7%

links
Categorical

HIGH CARDINALITY  MISSING 

Distinct19761
Distinct (%)98.9%
Missing29901
Missing (%)59.9%
Memory size389.9 KiB
[TextLink(text='floridahealthcovid19.gov/covid-19-vacci…', url='https://floridahealthcovid19.gov/covid-19-vaccines-in-florida/', tcourl='https://t.co/h4jdZoPS1z', indices=(125, 148))]
 
15
[TextLink(text='floridahealthcovid19.gov/covid-19-vacci…', url='https://floridahealthcovid19.gov/covid-19-vaccines-in-florida/', tcourl='https://t.co/h4jdZoPS1z', indices=(119, 142))]
 
14
[TextLink(text='floridahealthcovid19.gov/covid-19-vacci…', url='https://floridahealthcovid19.gov/covid-19-vaccines-in-florida/', tcourl='https://t.co/h4jdZoPS1z', indices=(185, 208))]
 
13
[TextLink(text='floridahealthcovid19.gov/covid-19-vacci…', url='https://floridahealthcovid19.gov/covid-19-vaccines-in-florida/', tcourl='https://t.co/h4jdZoPS1z', indices=(182, 205))]
 
11
[TextLink(text='floridahealthcovid19.gov/covid-19-vacci…', url='https://floridahealthcovid19.gov/covid-19-vaccines-in-florida/', tcourl='https://t.co/h4jdZoPS1z', indices=(118, 141))]
 
10
Other values (19756)
19922 

Unique

Unique19645 ?
Unique (%)98.3%

Sample

1st row[TextLink(text='cnn.com', url='https://www.cnn.com/', tcourl='https://t.co/Awn955wZ8v', indices=(168, 191))]
2nd row[TextLink(text='health.state.mn.us/people/immuniz…', url='https://www.health.state.mn.us/people/immunize/basics/vaxfinder.html', tcourl='https://t.co/ktfnSM9BqL', indices=(261, 284))]
3rd row[TextLink(text='nytimes.com/2020/01/05/bus…', url='https://www.nytimes.com/2020/01/05/business/boeing-737-max.html?smid=nytcore-ios-share', tcourl='https://t.co/wfhngnB0g1', indices=(177, 200))]
4th row[TextLink(text='bit.ly/2tDaqye', url='https://bit.ly/2tDaqye', tcourl='https://t.co/hSCR6cWVke', indices=(257, 280))]
5th row[TextLink(text='theverge.com/2020/1/9/21058…', url='https://www.theverge.com/2020/1/9/21058562/eraser-button-childrens-data-coppa-walberg-rush-hawley-markey-congress-house', tcourl='https://t.co/3IsWjwfH0J', indices=(207, 230))]

Common Values

ValueCountFrequency (%)
[TextLink(text='floridahealthcovid19.gov/covid-19-vacci…', url='https://floridahealthcovid19.gov/covid-19-vaccines-in-florida/', tcourl='https://t.co/h4jdZoPS1z', indices=(125, 148))] 15
 
< 0.1%
[TextLink(text='floridahealthcovid19.gov/covid-19-vacci…', url='https://floridahealthcovid19.gov/covid-19-vaccines-in-florida/', tcourl='https://t.co/h4jdZoPS1z', indices=(119, 142))] 14
 
< 0.1%
[TextLink(text='floridahealthcovid19.gov/covid-19-vacci…', url='https://floridahealthcovid19.gov/covid-19-vaccines-in-florida/', tcourl='https://t.co/h4jdZoPS1z', indices=(185, 208))] 13
 
< 0.1%
[TextLink(text='floridahealthcovid19.gov/covid-19-vacci…', url='https://floridahealthcovid19.gov/covid-19-vaccines-in-florida/', tcourl='https://t.co/h4jdZoPS1z', indices=(182, 205))] 11
 
< 0.1%
[TextLink(text='floridahealthcovid19.gov/covid-19-vacci…', url='https://floridahealthcovid19.gov/covid-19-vaccines-in-florida/', tcourl='https://t.co/h4jdZoPS1z', indices=(118, 141))] 10
 
< 0.1%
[TextLink(text='rubio.senate.gov/public/_cache/…', url='https://www.rubio.senate.gov/public/_cache/files/20e74c90-ac60-4227-a7fd-a50c5a429481/D10C63C7653FABA5F5F5EC74FF4E2B21.usc-abroad-help.pdf', tcourl='https://t.co/eobqWCTM56', indices=(228, 251))] 6
 
< 0.1%
[TextLink(text='floridahealthcovid19.gov/covid-19-vacci…', url='https://floridahealthcovid19.gov/covid-19-vaccines-in-florida/', tcourl='https://t.co/h4jdZoPS1z', indices=(177, 200))] 6
 
< 0.1%
[TextLink(text='rubio.senate.gov/public/_cache/…', url='https://www.rubio.senate.gov/public/_cache/files/12825875-93db-4e70-9a2b-21399af3166e/23E439BD3344D4AAAED05F6445188AEA.support.1585005601771.spanish-one-pager-covid-19-pac-3.23-.pdf', tcourl='https://t.co/AjeFKFmQQL', indices=(241, 264))] 5
 
< 0.1%
[TextLink(text='access.live/Tillis', url='https://access.live/Tillis', tcourl='https://t.co/EjMmqNjrwu', indices=(85, 108))] 5
 
< 0.1%
[TextLink(text='manchin.senate.gov/contact-joe/e-…', url='https://www.manchin.senate.gov/contact-joe/e-newsletter-signup', tcourl='https://t.co/cbYJ0VvRAX', indices=(187, 210))] 5
 
< 0.1%
Other values (19751) 19895
39.9%
(Missing) 29901
59.9%

place
Categorical

Distinct16
Distinct (%)48.5%
Missing49853
Missing (%)99.9%
Memory size389.9 KiB
Place(id='01fbe706f872cb32', fullName='Washington, DC', name='Washington', type='city', country='United States', countryCode='US')
12 
Place(id='319ee7b36c9149da', fullName='Arlington, VA', name='Arlington', type='city', country='United States', countryCode='US')
Place(id='23de0eea95a2714f', fullName='Red Oak, IA', name='Red Oak', type='city', country='United States', countryCode='US')
Place(id='1c67f9d9cbae7f69', fullName='Des Moines, IA', name='Des Moines', type='city', country='United States', countryCode='US')
Place(id='e7bae49f1ac7f22e', fullName='Salina, KS', name='Salina', type='city', country='United States', countryCode='US')
 
1
Other values (11)
11 

Unique

Unique12 ?
Unique (%)36.4%

Sample

1st rowPlace(id='01fbe706f872cb32', fullName='Washington, DC', name='Washington', type='city', country='United States', countryCode='US')
2nd rowPlace(id='01fbe706f872cb32', fullName='Washington, DC', name='Washington', type='city', country='United States', countryCode='US')
3rd rowPlace(id='01fbe706f872cb32', fullName='Washington, DC', name='Washington', type='city', country='United States', countryCode='US')
4th rowPlace(id='319ee7b36c9149da', fullName='Arlington, VA', name='Arlington', type='city', country='United States', countryCode='US')
5th rowPlace(id='319ee7b36c9149da', fullName='Arlington, VA', name='Arlington', type='city', country='United States', countryCode='US')

Common Values

ValueCountFrequency (%)
Place(id='01fbe706f872cb32', fullName='Washington, DC', name='Washington', type='city', country='United States', countryCode='US') 12
 
< 0.1%
Place(id='319ee7b36c9149da', fullName='Arlington, VA', name='Arlington', type='city', country='United States', countryCode='US') 5
 
< 0.1%
Place(id='23de0eea95a2714f', fullName='Red Oak, IA', name='Red Oak', type='city', country='United States', countryCode='US') 2
 
< 0.1%
Place(id='1c67f9d9cbae7f69', fullName='Des Moines, IA', name='Des Moines', type='city', country='United States', countryCode='US') 2
 
< 0.1%
Place(id='e7bae49f1ac7f22e', fullName='Salina, KS', name='Salina', type='city', country='United States', countryCode='US') 1
 
< 0.1%
Place(id='00e9226863a6e5a4', fullName='Savannah, GA', name='Savannah', type='city', country='United States', countryCode='US') 1
 
< 0.1%
Place(id='44d207663001f00b', fullName='Mesa, AZ', name='Mesa', type='city', country='United States', countryCode='US') 1
 
< 0.1%
Place(id='2ca1e1d1d0fae614', fullName='Dover, DE', name='Dover', type='city', country='United States', countryCode='US') 1
 
< 0.1%
Place(id='2d83c71ce16cd187', fullName='West Virginia, USA', name='West Virginia', type='admin', country='United States', countryCode='US') 1
 
< 0.1%
Place(id='0ca7e086f2c3e60f', fullName='Onawa, IA', name='Onawa', type='city', country='United States', countryCode='US') 1
 
< 0.1%
Other values (6) 6
 
< 0.1%
(Missing) 49853
99.9%

quotedTweet
Categorical

HIGH CARDINALITY  MISSING 

Distinct6010
Distinct (%)92.2%
Missing43370
Missing (%)86.9%
Memory size389.9 KiB
https://twitter.com/US_FDA/status/1429800729917669379
 
13
https://twitter.com/nytimes/status/1282795107478056963
 
9
https://twitter.com/realDonaldTrump/status/1266041589455077381
 
8
https://twitter.com/CNBCnow/status/1275823820448976897
 
8
https://twitter.com/realDonaldTrump/status/1311892190680014849
 
7
Other values (6005)
6471 

Unique

Unique5653 ?
Unique (%)86.8%

Sample

1st rowhttps://twitter.com/realDonaldTrump/status/1215287261606051844
2nd rowhttps://twitter.com/mitchellreports/status/1215689111681536005
3rd rowhttps://twitter.com/USATODAY/status/1215704357636722688
4th rowhttps://twitter.com/Haleaziz/status/1215739606613680129
5th rowhttps://twitter.com/HuffPost/status/1215800380975714305

Common Values

ValueCountFrequency (%)
https://twitter.com/US_FDA/status/1429800729917669379 13
 
< 0.1%
https://twitter.com/nytimes/status/1282795107478056963 9
 
< 0.1%
https://twitter.com/realDonaldTrump/status/1266041589455077381 8
 
< 0.1%
https://twitter.com/CNBCnow/status/1275823820448976897 8
 
< 0.1%
https://twitter.com/realDonaldTrump/status/1311892190680014849 7
 
< 0.1%
https://twitter.com/politico/status/1245099442832912387 7
 
< 0.1%
https://twitter.com/CDCgov/status/1392911350058323973 6
 
< 0.1%
https://twitter.com/SBAgov/status/1389207228473872385 6
 
< 0.1%
https://twitter.com/SenCoryGardner/status/1263138698708889606 6
 
< 0.1%
https://twitter.com/realDonaldTrump/status/1313551795646541824 5
 
< 0.1%
Other values (6000) 6441
 
12.9%
(Missing) 43370
86.9%

url
Categorical

Distinct49841
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
https://twitter.com/CoryGardner/status/1257455023753232392
 
4
https://twitter.com/CoryGardner/status/1247272748994506757
 
4
https://twitter.com/CoryGardner/status/1313603731926130688
 
4
https://twitter.com/CoryGardner/status/1300796593319432192
 
4
https://twitter.com/CoryGardner/status/1258103471531835397
 
4
Other values (49836)
49866 

Unique

Unique49826 ?
Unique (%)99.9%

Sample

1st rowhttps://twitter.com/ThomTillis/status/1212787945423491073
2nd rowhttps://twitter.com/amyklobuchar/status/1212829387713855490
3rd rowhttps://twitter.com/SenAmyKlobuchar/status/1213142726046298112
4th rowhttps://twitter.com/SenBlumenthal/status/1213976618240290816
5th rowhttps://twitter.com/SenJackyRosen/status/1215046483629789184

Common Values

ValueCountFrequency (%)
https://twitter.com/CoryGardner/status/1257455023753232392 4
 
< 0.1%
https://twitter.com/CoryGardner/status/1247272748994506757 4
 
< 0.1%
https://twitter.com/CoryGardner/status/1313603731926130688 4
 
< 0.1%
https://twitter.com/CoryGardner/status/1300796593319432192 4
 
< 0.1%
https://twitter.com/CoryGardner/status/1258103471531835397 4
 
< 0.1%
https://twitter.com/CoryGardner/status/1312117497705889792 4
 
< 0.1%
https://twitter.com/CoryGardner/status/1261456165000540161 4
 
< 0.1%
https://twitter.com/CoryGardner/status/1305680725367193600 4
 
< 0.1%
https://twitter.com/CoryGardner/status/1265344170811482113 4
 
< 0.1%
https://twitter.com/CoryGardner/status/1247983372250624003 4
 
< 0.1%
Other values (49831) 49846
99.9%

date
Categorical

Distinct48359
Distinct (%)96.9%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
2020-12-22 15:07:47+00:00
 
4
2021-03-06 17:29:07+00:00
 
4
2020-10-09 23:23:39+00:00
 
4
2020-10-09 23:41:19+00:00
 
4
2020-12-08 16:28:44+00:00
 
4
Other values (48354)
49866 

Unique

Unique47017 ?
Unique (%)94.2%

Sample

1st row2020-01-02 17:29:13+00:00
2nd row2020-01-02 20:13:53+00:00
3rd row2020-01-03 16:58:59+00:00
4th row2020-01-06 00:12:35+00:00
5th row2020-01-08 23:03:50+00:00

Common Values

ValueCountFrequency (%)
2020-12-22 15:07:47+00:00 4
 
< 0.1%
2021-03-06 17:29:07+00:00 4
 
< 0.1%
2020-10-09 23:23:39+00:00 4
 
< 0.1%
2020-10-09 23:41:19+00:00 4
 
< 0.1%
2020-12-08 16:28:44+00:00 4
 
< 0.1%
2020-09-09 14:16:59+00:00 4
 
< 0.1%
2020-09-09 14:17:00+00:00 4
 
< 0.1%
2020-04-06 21:19:31+00:00 4
 
< 0.1%
2021-05-06 15:04:19+00:00 4
 
< 0.1%
2020-05-18 16:26:21+00:00 4
 
< 0.1%
Other values (48349) 49846
99.9%

replyCount
Categorical

Distinct2208
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
1
 
2190
2
 
2111
3
 
2049
4
 
1916
5
 
1777
Other values (2203)
39843 

Unique

Unique1066 ?
Unique (%)2.1%

Sample

1st row64
2nd row70
3rd row7
4th row21
5th row6

Common Values

ValueCountFrequency (%)
1 2190
 
4.4%
2 2111
 
4.2%
3 2049
 
4.1%
4 1916
 
3.8%
5 1777
 
3.6%
6 1548
 
3.1%
7 1412
 
2.8%
8 1373
 
2.8%
9 1187
 
2.4%
0 1185
 
2.4%
Other values (2198) 33138
66.4%

retweetCount
Categorical

Distinct3351
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
3
 
2430
4
 
2337
2
 
2228
5
 
2060
6
 
1892
Other values (3346)
38939 

Unique

Unique1731 ?
Unique (%)3.5%

Sample

1st row22
2nd row613
3rd row18
4th row49
5th row9

Common Values

ValueCountFrequency (%)
3 2430
 
4.9%
4 2337
 
4.7%
2 2228
 
4.5%
5 2060
 
4.1%
6 1892
 
3.8%
7 1597
 
3.2%
1 1590
 
3.2%
8 1490
 
3.0%
9 1319
 
2.6%
10 1102
 
2.2%
Other values (3341) 31841
63.8%

likeCount
Categorical

Distinct6330
Distinct (%)12.7%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
9
 
740
8
 
695
7
 
679
12
 
658
10
 
650
Other values (6325)
46464 

Unique

Unique3869 ?
Unique (%)7.8%

Sample

1st row131
2nd row2449
3rd row41
4th row137
5th row48

Common Values

ValueCountFrequency (%)
9 740
 
1.5%
8 695
 
1.4%
7 679
 
1.4%
12 658
 
1.3%
10 650
 
1.3%
11 643
 
1.3%
13 622
 
1.2%
6 621
 
1.2%
14 604
 
1.2%
17 592
 
1.2%
Other values (6320) 43382
87.0%

quoteCount
Categorical

HIGH CARDINALITY  IMBALANCE 

Distinct1042
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
0
12162 
1
8334 
2
5191 
3
3280 
4
2388 
Other values (1037)
18531 

Unique

Unique518 ?
Unique (%)1.0%

Sample

1st row12
2nd row22
3rd row1
4th row3
5th row2

Common Values

ValueCountFrequency (%)
0 12162
24.4%
1 8334
16.7%
2 5191
10.4%
3 3280
 
6.6%
4 2388
 
4.8%
5 1710
 
3.4%
6 1460
 
2.9%
7 1070
 
2.1%
8 959
 
1.9%
9 706
 
1.4%
Other values (1032) 12626
25.3%

conversationId
Categorical

Distinct46570
Distinct (%)93.4%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
1243022844767940610
 
16
1303693449074479104
 
14
1262914459774918658
 
13
1234575422039433221
 
12
1240424113841360896
 
12
Other values (46565)
49819 

Unique

Unique44220 ?
Unique (%)88.6%

Sample

1st row1212787945423491073
2nd row1212829387713855490
3rd row1213142726046298112
4th row1213976618240290816
5th row1215046483629789184

Common Values

ValueCountFrequency (%)
1243022844767940610 16
 
< 0.1%
1303693449074479104 14
 
< 0.1%
1262914459774918658 13
 
< 0.1%
1234575422039433221 12
 
< 0.1%
1240424113841360896 12
 
< 0.1%
1288914034939527168 11
 
< 0.1%
1275427516241588225 11
 
< 0.1%
1232703071609860099 10
 
< 0.1%
1245021944312979456 10
 
< 0.1%
1599785697241464838 9
 
< 0.1%
Other values (46560) 49768
99.8%

contains_keyword
Categorical

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
True
49886 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowTrue
2nd rowTrue
3rd rowTrue
4th rowTrue
5th rowTrue

Common Values

ValueCountFrequency (%)
True 49886
100.0%

Common Values (Plot)

2023-08-08T14:58:41.796432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

keywords
Categorical

HIGH CARDINALITY  IMBALANCE 

Distinct1734
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
covid
9396 
pandemic
9103 
covid-19
6107 
corona
4789 
vaccine
1999 
Other values (1729)
18492 

Unique

Unique1059 ?
Unique (%)2.1%

Sample

1st rowcdc
2nd rown95
3rd rowvaccine
4th rowwfh
5th rowepidemic

Common Values

ValueCountFrequency (%)
covid 9396
18.8%
pandemic 9103
18.2%
covid-19 6107
12.2%
corona 4789
 
9.6%
vaccine 1999
 
4.0%
covid,pandemic 1413
 
2.8%
covid-19,pandemic 1294
 
2.6%
corona,pandemic 1172
 
2.3%
covid,vaccine 818
 
1.6%
covid-19,vaccine 804
 
1.6%
Other values (1724) 12991
26.0%

level_0
Categorical

Distinct109
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
73.0
 
1864
37.0
 
1479
91.0
 
1405
105.0
 
1218
29.0
 
1157
Other values (104)
42763 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row99.0
2nd row58.0
3rd row58.0
4th row14.0
5th row82.0

Common Values

ValueCountFrequency (%)
73.0 1864
 
3.7%
37.0 1479
 
3.0%
91.0 1405
 
2.8%
105.0 1218
 
2.4%
29.0 1157
 
2.3%
3.0 1110
 
2.2%
34.0 1108
 
2.2%
93.0 1091
 
2.2%
84.0 1021
 
2.0%
82.0 976
 
2.0%
Other values (99) 37457
75.1%

index_y
Categorical

Distinct109
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
73.0
 
1864
37.0
 
1479
91.0
 
1405
105.0
 
1218
29.0
 
1157
Other values (104)
42763 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row99.0
2nd row58.0
3rd row58.0
4th row14.0
5th row82.0

Common Values

ValueCountFrequency (%)
73.0 1864
 
3.7%
37.0 1479
 
3.0%
91.0 1405
 
2.8%
105.0 1218
 
2.4%
29.0 1157
 
2.3%
3.0 1110
 
2.2%
34.0 1108
 
2.2%
93.0 1091
 
2.2%
84.0 1021
 
2.0%
82.0 976
 
2.0%
Other values (99) 37457
75.1%

name
Categorical

Distinct109
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
Murray, Patty
 
1864
Durbin, Richard J.
 
1479
Shaheen, Jeanne
 
1405
Warren, Elizabeth
 
1218
Cornyn, John
 
1157
Other values (104)
42763 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowTillis, Thom
2nd rowKlobuchar, Amy
3rd rowKlobuchar, Amy
4th rowBlumenthal, Richard
5th rowRosen, Jacky

Common Values

ValueCountFrequency (%)
Murray, Patty 1864
 
3.7%
Durbin, Richard J. 1479
 
3.0%
Shaheen, Jeanne 1405
 
2.8%
Warren, Elizabeth 1218
 
2.4%
Cornyn, John 1157
 
2.3%
Harris, Kamala 1110
 
2.2%
Cruz, Ted 1108
 
2.2%
Sinema, Kyrsten 1091
 
2.2%
Rubio, Marco 1021
 
2.0%
Rosen, Jacky 976
 
2.0%
Other values (99) 37457
75.1%

id.1
Categorical

Distinct109
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
74.0
 
1864
38.0
 
1479
92.0
 
1405
106.0
 
1218
30.0
 
1157
Other values (104)
42763 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row100.0
2nd row59.0
3rd row59.0
4th row15.0
5th row83.0

Common Values

ValueCountFrequency (%)
74.0 1864
 
3.7%
38.0 1479
 
3.0%
92.0 1405
 
2.8%
106.0 1218
 
2.4%
30.0 1157
 
2.3%
4.0 1110
 
2.2%
35.0 1108
 
2.2%
94.0 1091
 
2.2%
85.0 1021
 
2.0%
83.0 976
 
2.0%
Other values (99) 37457
75.1%

state_short
Categorical

Distinct50
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
CA
 
2402
IL
 
2289
TX
 
2265
NH
 
2208
MA
 
2154
Other values (45)
38568 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNC
2nd rowMN
3rd rowMN
4th rowCT
5th rowNV

Common Values

ValueCountFrequency (%)
CA 2402
 
4.8%
IL 2289
 
4.6%
TX 2265
 
4.5%
NH 2208
 
4.4%
MA 2154
 
4.3%
WA 2076
 
4.2%
FL 1781
 
3.6%
AZ 1766
 
3.5%
NV 1755
 
3.5%
TN 1500
 
3.0%
Other values (40) 29690
59.5%

party
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
1.0
28204 
0.0
19612 
2.0
 
2070

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0.0
2nd row1.0
3rd row1.0
4th row1.0
5th row1.0

Common Values

ValueCountFrequency (%)
1.0 28204
56.5%
0.0 19612
39.3%
2.0 2070
 
4.1%

Common Values (Plot)

2023-08-08T14:58:41.865336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

class
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
2.0
17187 
3.0
16575 
1.0
16124 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2.0
2nd row1.0
3rd row1.0
4th row3.0
5th row1.0

Common Values

ValueCountFrequency (%)
2.0 17187
34.5%
3.0 16575
33.2%
1.0 16124
32.3%

Common Values (Plot)

2023-08-08T14:58:41.940459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

ideology
Categorical

Distinct108
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
0.142703588817088
 
1864
0.0855733771029607
 
1479
0.2925665319541
 
1405
0.0583875007437665
 
1218
0.772226738391321
 
1157
Other values (103)
42763 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0.819146177750934
2nd row0.130504324943533
3rd row0.130504324943533
4th row0.0310655954121906
5th row0.308548351377894

Common Values

ValueCountFrequency (%)
0.142703588817088 1864
 
3.7%
0.0855733771029607 1479
 
3.0%
0.2925665319541 1405
 
2.8%
0.0583875007437665 1218
 
2.4%
0.772226738391321 1157
 
2.3%
0.0213759569468058 1110
 
2.2%
0.944056385174951 1108
 
2.2%
0.500967034663567 1091
 
2.2%
0.831181764071725 1021
 
2.0%
0.308548351377894 976
 
2.0%
Other values (98) 37457
75.1%

start_serving
Categorical

Distinct42
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
01/03/2013
4740 
01/06/2015
4244 
01/05/2011
4008 
01/03/2017
3941 
01/03/2019
3795 
Other values (37)
29158 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row01/06/2015
2nd row01/04/2007
3rd row01/04/2007
4th row01/03/2010
5th row01/03/2019

Common Values

ValueCountFrequency (%)
01/03/2013 4740
 
9.5%
01/06/2015 4244
 
8.5%
01/05/2011 4008
 
8.0%
01/03/2017 3941
 
7.9%
01/03/2019 3795
 
7.6%
01/04/2007 3261
 
6.5%
01/06/2009 2634
 
5.3%
01/03/2021 2221
 
4.5%
01/07/1997 2155
 
4.3%
01/05/1993 1864
 
3.7%
Other values (32) 17023
34.1%

end_serving
Categorical

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
12/31/2022
47062 
01/18/2021
 
1110
01/03/2021
 
1007
01/20/2021
 
450
01/03/2019
 
257

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row12/31/2022
2nd row12/31/2022
3rd row12/31/2022
4th row12/31/2022
5th row12/31/2022

Common Values

ValueCountFrequency (%)
12/31/2022 47062
94.3%
01/18/2021 1110
 
2.2%
01/03/2021 1007
 
2.0%
01/20/2021 450
 
0.9%
01/03/2019 257
 
0.5%

Common Values (Plot)

2023-08-08T14:58:42.017255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

time_in_office
Categorical

Distinct46
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
9.9972602739726
4740 
11.9945205479452
4008 
7.98904109589041
3959 
3.99452054794521
3795 
16.0
3261 
Other values (41)
30123 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row7.98904109589041
2nd row16.0
3rd row16.0
4th row13.0
5th row3.99452054794521

Common Values

ValueCountFrequency (%)
9.9972602739726 4740
 
9.5%
11.9945205479452 4008
 
8.0%
7.98904109589041 3959
 
7.9%
3.99452054794521 3795
 
7.6%
16.0 3261
 
6.5%
5.99452054794521 2831
 
5.7%
13.9917808219178 2317
 
4.6%
1.99178082191781 2221
 
4.5%
25.9972602739726 2075
 
4.2%
30.0054794520548 1864
 
3.7%
Other values (36) 18815
37.7%

not_in_office
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
0.0
44451 
1.0
5435 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0.0
2nd row0.0
3rd row0.0
4th row0.0
5th row0.0

Common Values

ValueCountFrequency (%)
0.0 44451
89.1%
1.0 5435
 
10.9%

Common Values (Plot)

2023-08-08T14:58:42.315080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

last_congress
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
117.0
47062 
116.0
 
2824

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row117.0
2nd row117.0
3rd row117.0
4th row117.0
5th row117.0

Common Values

ValueCountFrequency (%)
117.0 47062
94.3%
116.0 2824
 
5.7%

Common Values (Plot)

2023-08-08T14:58:42.381014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

vote_share
Categorical

HIGH CARDINALITY  MISSING 

Distinct96
Distinct (%)0.2%
Missing1460
Missing (%)2.9%
Memory size389.9 KiB
59.1
 
1864
60.4
 
1510
53.5
 
1497
54.9
 
1479
50.9
 
1411
Other values (91)
40665 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row48.7
2nd row60.3
3rd row60.3
4th row62.9
5th row50.4

Common Values

ValueCountFrequency (%)
59.1 1864
 
3.7%
60.4 1510
 
3.0%
53.5 1497
 
3.0%
54.9 1479
 
3.0%
50.9 1411
 
2.8%
56.6 1405
 
2.8%
62.4 1316
 
2.6%
50.0 1091
 
2.2%
52.0 1021
 
2.0%
50.4 976
 
2.0%
Other values (86) 34856
69.9%
(Missing) 1460
 
2.9%

next_closest_share
Categorical

HIGH CARDINALITY  MISSING 

Distinct89
Distinct (%)0.2%
Missing1460
Missing (%)2.9%
Memory size389.9 KiB
40.9
 
2302
36.2
 
2211
43.9
 
1876
33.0
 
1570
38.9
 
1479
Other values (84)
38988 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row46.9
2nd row36.2
3rd row36.2
4th row34.9
5th row45.4

Common Values

ValueCountFrequency (%)
40.9 2302
 
4.6%
36.2 2211
 
4.4%
43.9 1876
 
3.8%
33.0 1570
 
3.1%
38.9 1479
 
3.0%
45.4 1478
 
3.0%
41.0 1405
 
2.8%
37.6 1110
 
2.2%
48.3 1108
 
2.2%
47.6 1091
 
2.2%
Other values (79) 32796
65.7%
(Missing) 1460
 
2.9%

election_year
Categorical

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
2020
16369 
2018
16124 
2016
14926 
*
 
1460
2014
 
946

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2018
3rd row2018
4th row2016
5th row2018

Common Values

ValueCountFrequency (%)
2020 16369
32.8%
2018 16124
32.3%
2016 14926
29.9%
* 1460
 
2.9%
2014 946
 
1.9%
2017 61
 
0.1%

Common Values (Plot)

2023-08-08T14:58:42.456383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

alt_handle
Categorical

HIGH CARDINALITY  MISSING 

Distinct68
Distinct (%)0.3%
Missing22882
Missing (%)45.9%
Memory size389.9 KiB
murraycampaign
 
1503
JeanneShaheen
 
1314
DickDurbin
 
1111
RosenforNevada
 
959
ewarren
 
822
Other values (63)
21295 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowamyklobuchar
2nd rowRosenforNevada
3rd rowedmarkey
4th rowSenBrianSchatz
5th rowBernieSanders

Common Values

ValueCountFrequency (%)
murraycampaign 1503
 
3.0%
JeanneShaheen 1314
 
2.6%
DickDurbin 1111
 
2.2%
RosenforNevada 959
 
1.9%
ewarren 822
 
1.6%
chuckschumer 782
 
1.6%
CortezMasto 756
 
1.5%
kyrstensinema 754
 
1.5%
Maggie_Hassan 747
 
1.5%
scottforflorida 743
 
1.5%
Other values (58) 17513
35.1%
(Missing) 22882
45.9%

date_of_birth
Categorical

Distinct109
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
10/11/1950
 
1864
11/21/1944
 
1479
01/28/1947
 
1405
06/22/1949
 
1218
02/02/1952
 
1157
Other values (104)
42763 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row08/30/1960
2nd row05/25/1960
3rd row05/25/1960
4th row02/13/1946
5th row08/02/1957

Common Values

ValueCountFrequency (%)
10/11/1950 1864
 
3.7%
11/21/1944 1479
 
3.0%
01/28/1947 1405
 
2.8%
06/22/1949 1218
 
2.4%
02/02/1952 1157
 
2.3%
10/20/1964 1110
 
2.2%
12/22/1970 1108
 
2.2%
07/12/1976 1091
 
2.2%
05/28/1971 1021
 
2.0%
08/02/1957 976
 
2.0%
Other values (99) 37457
75.1%

female
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
0.0
32102 
1.0
17784 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0.0
2nd row1.0
3rd row1.0
4th row0.0
5th row1.0

Common Values

ValueCountFrequency (%)
0.0 32102
64.4%
1.0 17784
35.6%

Common Values (Plot)

2023-08-08T14:58:42.536215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

ethnicity
Categorical

Distinct7
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
White
41469 
Hispanic
 
3194
Hispanic; White
 
1887
African-American; Asian-American
 
1548
Asian; White
 
810
Other values (2)
 
978

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowWhite
2nd rowWhite
3rd rowWhite
4th rowWhite
5th rowWhite

Common Values

ValueCountFrequency (%)
White 41469
83.1%
Hispanic 3194
 
6.4%
Hispanic; White 1887
 
3.8%
African-American; Asian-American 1548
 
3.1%
Asian; White 810
 
1.6%
African-American 755
 
1.5%
Asian 223
 
0.4%

Common Values (Plot)

2023-08-08T14:58:42.612673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

edu_level
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
8.0
28384 
6.0
10969 
7.0
10533 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row6.0
2nd row8.0
3rd row8.0
4th row8.0
5th row6.0

Common Values

ValueCountFrequency (%)
8.0 28384
56.9%
6.0 10969
 
22.0%
7.0 10533
 
21.1%

Common Values (Plot)

2023-08-08T14:58:42.696154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

edu_information
Categorical

Distinct109
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
B.A.; Physical Education; Washington State University; 1972
 
1864
J.D.; Georgetown University; 1969
 
1479
M.S.S.; University of Mississippi; 1973
 
1405
J.D.; Rutgers University; 1976
 
1218
J.D.; St. Mary’s School of Law; 1977
 
1157
Other values (104)
42763 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowB.S.; Technology Management; University of Maryland; 1996
2nd rowJ.D.; University of Chicago, 1985
3rd rowJ.D.; University of Chicago, 1985
4th rowJ.D.; Yale University; 1973
5th rowB.A.; Psychology; University of Minnesota; 1979

Common Values

ValueCountFrequency (%)
B.A.; Physical Education; Washington State University; 1972 1864
 
3.7%
J.D.; Georgetown University; 1969 1479
 
3.0%
M.S.S.; University of Mississippi; 1973 1405
 
2.8%
J.D.; Rutgers University; 1976 1218
 
2.4%
J.D.; St. Mary’s School of Law; 1977 1157
 
2.3%
J.D.; University of California; 1989 1110
 
2.2%
J.D.; Harvard University; 1995 1108
 
2.2%
PhD in Justice Studies; Arizona State University; 2012 1091
 
2.2%
J.D.; University of Miami; 1996 1021
 
2.0%
B.A.; Psychology; University of Minnesota; 1979 976
 
2.0%
Other values (99) 37457
75.1%

occup_level
Categorical

Distinct13
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
2.0
22193 
1.0
5375 
5.0
5372 
0.0
5124 
11.0
3837 
Other values (8)
7985 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1.0
2nd row2.0
3rd row2.0
4th row2.0
5th row1.0

Common Values

ValueCountFrequency (%)
2.0 22193
44.5%
1.0 5375
 
10.8%
5.0 5372
 
10.8%
0.0 5124
 
10.3%
11.0 3837
 
7.7%
3.0 2791
 
5.6%
6.0 2096
 
4.2%
12.0 796
 
1.6%
9.0 753
 
1.5%
8.0 478
 
1.0%
Other values (3) 1071
 
2.1%

alt
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
False
39080 
True
10806 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowTrue
2nd rowTrue
3rd rowFalse
4th rowFalse
5th rowFalse

Common Values

ValueCountFrequency (%)
False 39080
78.3%
True 10806
 
21.7%

Common Values (Plot)

2023-08-08T14:58:42.767250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

twitter_handle
Categorical

HIGH CARDINALITY  MISSING 

Distinct62
Distinct (%)0.6%
Missing39080
Missing (%)78.3%
Memory size389.9 KiB
SenAmyKlobuchar
 
576
SenatorMarshall
 
561
SenAlexPadilla
 
553
VP
 
450
SenMarkey
 
413
Other values (57)
8253 

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowSenThomTillis
2nd rowSenAmyKlobuchar
3rd rowSenAlexPadilla
4th rowSenMarkey
5th rowSenWarren

Common Values

ValueCountFrequency (%)
SenAmyKlobuchar 576
 
1.2%
SenatorMarshall 561
 
1.1%
SenAlexPadilla 553
 
1.1%
VP 450
 
0.9%
SenMarkey 413
 
0.8%
SenTedCruz 408
 
0.8%
SenWarren 396
 
0.8%
SenSanders 381
 
0.8%
senrandpaul 373
 
0.7%
senmarcorubio 370
 
0.7%
Other values (52) 6325
 
12.7%
(Missing) 39080
78.3%

tweetLen
Categorical

Distinct318
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size389.9 KiB
280
 
1559
279
 
1271
278
 
1067
277
 
971
284
 
917
Other values (313)
44101 

Unique

Unique13 ?
Unique (%)< 0.1%

Sample

1st row134
2nd row191
3rd row308
4th row200
5th row291

Common Values

ValueCountFrequency (%)
280 1559
 
3.1%
279 1271
 
2.5%
278 1067
 
2.1%
277 971
 
1.9%
284 917
 
1.8%
276 855
 
1.7%
275 792
 
1.6%
283 758
 
1.5%
274 689
 
1.4%
288 627
 
1.3%
Other values (308) 40380
80.9%
\ No newline at end of file