Professional Documents
Culture Documents
Aim
How data collected from mobile phones can be used to uncover regular rules and structures in the behavior of both individual and organization
Bluetooth
Bluetooth is short-range RF network
10-30 meters in practice
~450k hours of information about users location, communication, and usage behavior
http://reality.media.mit.edu
User modeling
Easily identifiable routines in every persons life Simple model of behavior
Home, work, elsewhere
User modeling
Accurate location from cell tower
Complicated as a phone can receive signals from far-away towers Accuracy gets better if user spends enough time
Distribution of time spent with a set of towers adds accuracy
Office mates
Observations
Different sets of towers for users within 10 m of radius 6% of time, users were without signal 21% to 29%, users were in range of Bluetooth devices or other mobile phones Could Bluetooth be used for localization inside building during such times?
GPS does not work indoors
There is inherent randomness present among the routines Use of information entropy metric to quantify the predictable amount
User modeling
Role of time is very clear in predicting user behavior Uses HMM and EM to model and trains with 1 month of data 95% accuracy achieved
Not much use of sophisticated features Snake used as much as elaborate media player
Average application usage in three locations (other, work, and home) for 100 subjects.
The x-axis displays the fraction of time each application is used, as a function of total application usage. For example, the usage at home of the clock application comprises almost 3% of the total times the phone is used. The phone application itself comprises more than 80% of the total usage and was not included in this figure
Later increments were done in RAM and final logs were written to the card
Bluetooth errors
Several technical issues in verifying the accuracy of collected data
10m range with ability to penetrate walls Periodical scans miss short proximity event A device may not be discovered (1% to 3%) Application crash (once every three days)
Redundancy could be leveraged
Human-induced errors
Two main errors
Phone being off
Battery exhausted Explicit turn-off
1/5 of users do it regularly classrooms, night, movies. Log is time-stamped before the turn-off
Human-induced errors
Forgetting phone
30% claim of never forgetting it 40% claim once every month 30% claim once every week
Missing data
Major causes
Data corruption Powered-off devices
Surveys
Subjects were also surveyed about their social network For senior students
High correlation
Logged BTID and dyadic self-report/proximity data
Community structure
Human landmarks
Who the user will meet can be guessed
Relationship inference
Nature of association can be inferred
Proximity Frequency
Proximity networks
Different than the organizational structure
Structured around the faculty director
Hub-and-spoke with changing roles Proximity n/w data is extremely dynamic and sparse. Deadlines bring more reliance on support of the group
Exploring dynamics of a group in response to both external and internal stimuli
Proximity networks
Peoples free time and schedules shift dramatically to met deadlines and project goals
Spending much of the night in lab just before the event
Conclusions
First paper to log data at such a magnitude and depth Provides ethnographic studies, individual user modeling, group user modeling