Welcome to Scribd!

Curse of Dimensionality

Uploaded by

0% found this document useful (0 votes)

150 views9 pages

"Curse of dimensionality" refers to phenomena that arise when analyzing highdimensional data. Data analysis tools based on learning principles infer knowledge from available samples. The higher the dimensionality, the less the error, the better the performance.

Original Description:

Copyright

Available Formats

PPTX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

150 views9 pages

Curse of Dimensionality

Uploaded by

subithaperiyasamy

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 9

Search inside document

By,

S.Subha surya. R.nandhini

definition: The curse of dimensionality refers to various phenomena that arise when analyzing and organizing data in highdimensional spaces (often with hundreds or thousands of dimensions) that do not occur in low-dimensional settings such as the physical space commonly modeled with just three dimensions.

The curse of dimensionality one buzzword for many problem. Data analysis tools based on learning principles infer knowledge, or information, from available learning samples. Obviously, the models built through learning are only valid in the range or volume of the space where learning data are available. Whatever is the model or class of models, generalization on data that are much different from all learning points is impossible. In other words, relevant generalization is possible from interpolation but not from extrapolation.

The number of training samples What would the probability density function look like if the dimensionality is very high?
For a 7-dimensional space, where each variable could have 20 possible values, then the 7-d histogram contains 207 cells. To distributed a training set of some reasonable size (1000) among this many cells is to leave virtually all the cells empty

Accuracy and overfitting In theory, the higher the dimensionality, the less the error, the better the performance. However, in realistic PR problems, the opposite is often true. Why?
The assumption that pdf behaves like Gaussian is only approximately true When increasing the dimensionality, we may be overfitting the training set. Problem: excellent performance on the training set, poor performance on new data points which are in fact very close to the data within the training set

Visualization:

projection of high-dimensional data onto 2D or 3D. Data compression: efficient storage and retrieval. Noise removal: positive effect on query accuracy

Customer relationship management Text mining Image retrieval Microarray data analysis Protein classification Face recognition Handwritten digit recognition Intrusion detection

Given x RN, the goal is to find a linear transformation matrix U NxK such that y = UTx RK where K<<N Idea: represent vectors using a set of basis vectors in an appropriate lower dimensional space.
(1) Higher-dimensional space representation:

(2) Lower-dimensional space representation:

From a theoretical point of view, increasing the number of features should lead to better performance (assuming independent features). In practice, the inclusion of more features leads to worse performance (i.e., curse of dimensionality). Need exponential number of examples as dimensionality increases.

40 Questions To Test A Data Scientist On Time Series
Document26 pages
40 Questions To Test A Data Scientist On Time Series
Rajeshree Jadhav
No ratings yet
Graphic Design Business Plan
Document30 pages
Graphic Design Business Plan
Ogbemudia Afam
50% (2)
Lesson 2 - Nature of Research
Document11 pages
Lesson 2 - Nature of Research
Ruben Rosendal De Asis
100% (1)
Trends, Networks and Critical Thinking Skills in The 21st Century - TOPIC OUTLINE
Document3 pages
Trends, Networks and Critical Thinking Skills in The 21st Century - TOPIC OUTLINE
Rodrick Sonajo Ramos
100% (1)
Sustainable Development Plan of The Philippines
Document3 pages
Sustainable Development Plan of The Philippines
Zanter Guevarra
No ratings yet
Legal Profession
Document5 pages
Legal Profession
Basri Jay
No ratings yet
The Beginning of The Modern Cooperative Movement
Document32 pages
The Beginning of The Modern Cooperative Movement
Velayutham Duraikannu
No ratings yet
Unfortunately there is no Table 1.1 in the provided document. The document only contains figures and no tables
Document69 pages
Unfortunately there is no Table 1.1 in the provided document. The document only contains figures and no tables
Eloisa Potrich
No ratings yet
K-Means and PCA
Document69 pages
K-Means and PCA
vdjohn
No ratings yet
A Survey On Deep Learning Techniques For Medical Image Analysis Riyaj
Document20 pages
A Survey On Deep Learning Techniques For Medical Image Analysis Riyaj
disha rawal
100% (1)
Session 12 - Time Series and Forecasting (GbA) PDF
Document84 pages
Session 12 - Time Series and Forecasting (GbA) PDF
khkarthik
No ratings yet
Big Data and Hadoop Overview
Document17 pages
Big Data and Hadoop Overview
Shreekanth Vankamamidi, PMP
100% (1)
The Curse of Dimensionality - Towards Data Science PDF
Document9 pages
The Curse of Dimensionality - Towards Data Science PDF
Luciano
No ratings yet
An Overview of Deep Learning in Medical Imaging Fo
Document45 pages
An Overview of Deep Learning in Medical Imaging Fo
freak show
No ratings yet
Big Data Analytics in Life Insurance PDF
Document20 pages
Big Data Analytics in Life Insurance PDF
Subrahmanyam Siri
No ratings yet
Edge-to-Image GAN Reconstructs Fruits from Edges
Document6 pages
Edge-to-Image GAN Reconstructs Fruits from Edges
Soumya Polisetty
No ratings yet
Lemmatization Approaches
Document13 pages
Lemmatization Approaches
Luciano
No ratings yet
MAD-GAN: Multivariate Anomaly Detection For Time Series Data With Generative Adversarial Networks
Document17 pages
MAD-GAN: Multivariate Anomaly Detection For Time Series Data With Generative Adversarial Networks
askool99
No ratings yet
Using Big Data Analytics in The Field of Agriculture A Survey
Document3 pages
Using Big Data Analytics in The Field of Agriculture A Survey
Editor IJTSRD
No ratings yet
Design Ideas v. Lowe's - Complaint
Document219 pages
Design Ideas v. Lowe's - Complaint
Sarah Burstein
No ratings yet
DBMS
Document67 pages
DBMS
kcmaharshi
50% (2)
OCR GCSE Computing Python model answers
Document36 pages
OCR GCSE Computing Python model answers
Truong Tuan Kiet
No ratings yet
Big Data: Understanding the V's
Document30 pages
Big Data: Understanding the V's
Shradha Gupta
No ratings yet
Big Data Analytics For Oncology PDF
Document49 pages
Big Data Analytics For Oncology PDF
Philippe Julio
No ratings yet
35 Best New Big Data Ebooks To Read in 2020 - BookAuthority PDF
Document35 pages
35 Best New Big Data Ebooks To Read in 2020 - BookAuthority PDF
Pradyumna GR
No ratings yet
Current and Future Trends in Technology: Raphael Nkwazema JUNE 07, 2021
Document25 pages
Current and Future Trends in Technology: Raphael Nkwazema JUNE 07, 2021
IAM RISING
No ratings yet
Big Data Analytics and Its Application in ECommerce Giants
Document12 pages
Big Data Analytics and Its Application in ECommerce Giants
Mohit
No ratings yet
K-Means Clustering for Data Segmentation
Document57 pages
K-Means Clustering for Data Segmentation
Albin Mathew
0% (1)
Big Data Analytics: A Literature Review Paper: Lecture Notes in Computer Science August 2014
Document16 pages
Big Data Analytics: A Literature Review Paper: Lecture Notes in Computer Science August 2014
ferb
No ratings yet
Format For PG Disseration
Document8 pages
Format For PG Disseration
Shreyash Garud
No ratings yet
The Curse of Dimensionality and Dimensionality Reduction
Document51 pages
The Curse of Dimensionality and Dimensionality Reduction
dbsolutions
No ratings yet
DBMS concepts and SQL queries
Document3 pages
DBMS concepts and SQL queries
Manjusha Manju
No ratings yet
Data Science Talent Gap Emerges
Document6 pages
Data Science Talent Gap Emerges
Debasish
No ratings yet
Time Series Forecasting: Kick-Start Your Project With My New Book
Document39 pages
Time Series Forecasting: Kick-Start Your Project With My New Book
Waqas Hameed
No ratings yet
Feature Selection Techniques in ML With Python-1
Document7 pages
Feature Selection Techniques in ML With Python-1
Дхиа Еддине
No ratings yet
List of Cooperative Societies Banks Registered Under MSCS Act PDF
Document122 pages
List of Cooperative Societies Banks Registered Under MSCS Act PDF
Parthasarathi
No ratings yet
NLP in Stock Price Analysis
Document12 pages
NLP in Stock Price Analysis
Jyothi Burla
No ratings yet
Humanizing Big Data
Document16 pages
Humanizing Big Data
Kunal Gupta
No ratings yet
Using Generative Adversarial Networks For Improving Classification Effectiveness in Credit Card Fraud Detection
Document8 pages
Using Generative Adversarial Networks For Improving Classification Effectiveness in Credit Card Fraud Detection
Peppe
100% (1)
Privatization and Liberalization of The Extractive Industry in Zambia - Implications For The Resource Curse
Document100 pages
Privatization and Liberalization of The Extractive Industry in Zambia - Implications For The Resource Curse
krede22
100% (1)
What is Apache Pig? A concise overview
Document80 pages
What is Apache Pig? A concise overview
Mukul Verma
100% (2)
Data Augmentation Techniques I
Document23 pages
Data Augmentation Techniques I
Babu Gupta
No ratings yet
Employee payroll data with filters and calculations
Document178 pages
Employee payroll data with filters and calculations
Vinayak Shegar
No ratings yet
NLTK Installation Guide
Document13 pages
NLTK Installation Guide
MoTech
No ratings yet
TensorBoard Tutorial: Visualize Neural Network Training
Document31 pages
TensorBoard Tutorial: Visualize Neural Network Training
Stig Kalmo
No ratings yet
Keras - TF2 - Book
Document364 pages
Keras - TF2 - Book
Duy Linh
No ratings yet
DSM050 Data Visualisation Topic1
Document36 pages
DSM050 Data Visualisation Topic1
LICHEN YU
No ratings yet
Introduction To Big Data: Soorya Prasanna Ravichandran
Document33 pages
Introduction To Big Data: Soorya Prasanna Ravichandran
AbhishekChakladar
No ratings yet
Advanced Machine Learning
Document7 pages
Advanced Machine Learning
suman
No ratings yet
Peter Dueben: Royal Society University Research Fellow & ECMWF's Coordinator For Machine Learning and AI Activities
Document33 pages
Peter Dueben: Royal Society University Research Fellow & ECMWF's Coordinator For Machine Learning and AI Activities
Rizky Hamdani Sakti
100% (1)
Big Data Analytics PDF
Document22 pages
Big Data Analytics PDF
SAYYAD RAFI
No ratings yet
Complete Node.js Dev Course PDF Reference
Document125 pages
Complete Node.js Dev Course PDF Reference
kajaljain
No ratings yet
A Five R Analysis of Sustainable Supply Chain Management in Hongkong
Document15 pages
A Five R Analysis of Sustainable Supply Chain Management in Hongkong
Maggie Jiang
No ratings yet
Neelam Major
Document97 pages
Neelam Major
diksha
No ratings yet
Learn R Programming Fundamentals
Document36 pages
Learn R Programming Fundamentals
Naveen Vishvkarma
No ratings yet
Literature Review on Big Data Analytics Tools and Methods
Document6 pages
Literature Review on Big Data Analytics Tools and Methods
vishal
No ratings yet
Deep Transfer Learning Guide with Real-World Applications
Document47 pages
Deep Transfer Learning Guide with Real-World Applications
ashish.mukti223
No ratings yet
Understanding the Fundamentals of Big Data
Document9 pages
Understanding the Fundamentals of Big Data
Apriyudha Angkasa
No ratings yet
Datawarehousing FAQ
Document4 pages
Datawarehousing FAQ
Tamil Vanan
No ratings yet
Level 4 Databases Lecturer Guide
Document158 pages
Level 4 Databases Lecturer Guide
khem sharu
100% (1)
AI Question Bank 2017 18 CSE
Document4 pages
AI Question Bank 2017 18 CSE
Divyesh Nihalani
No ratings yet
Crime Data Analysis, Visualization and Prediction
Document10 pages
Crime Data Analysis, Visualization and Prediction
IJRASETPublications
No ratings yet
Introduction to Machine Learning in the Cloud with Python: Concepts and Practices
From Everand
Introduction to Machine Learning in the Cloud with Python: Concepts and Practices
Pramod Gupta
No ratings yet
Super Artificial Intelligence: Fundamentals and Applications
From Everand
Super Artificial Intelligence: Fundamentals and Applications
Fouad Sabry
No ratings yet
AI in Retail Second Edition
From Everand
AI in Retail Second Edition
Gerardus Blokdyk
No ratings yet
Virtual Machines (VMS), Exist For Most
Document4 pages
Virtual Machines (VMS), Exist For Most
subithaperiyasamy
No ratings yet
Unit 3
Document9 pages
Unit 3
subithaperiyasamy
No ratings yet
1.variable Reduction 2.principal Component Analysis: Topic UNIT-4
Document19 pages
1.variable Reduction 2.principal Component Analysis: Topic UNIT-4
subithaperiyasamy
No ratings yet
Data Pre-Processing: Submitted By, R.Archana, 10ucs05 D.Gayathri, 10ucs11
Document18 pages
Data Pre-Processing: Submitted By, R.Archana, 10ucs05 D.Gayathri, 10ucs11
subithaperiyasamy
No ratings yet
Ani
Document17 pages
Ani
subithaperiyasamy
No ratings yet
1.variable Reduction 2.principal Component Analysis: Topic UNIT-4
Document19 pages
1.variable Reduction 2.principal Component Analysis: Topic UNIT-4
subithaperiyasamy
No ratings yet
Predictive 7 Descriptive Analysis
Document10 pages
Predictive 7 Descriptive Analysis
subithaperiyasamy
No ratings yet
Intellingence Model - Conducting The Analysis
Document8 pages
Intellingence Model - Conducting The Analysis
subithaperiyasamy
No ratings yet
Visualization: Submitted by J.Indira Devi T.Sathya
Document8 pages
Visualization: Submitted by J.Indira Devi T.Sathya
subithaperiyasamy
No ratings yet
1.variable Reduction 2.principal Component Analysis: Topic UNIT-4
Document19 pages
1.variable Reduction 2.principal Component Analysis: Topic UNIT-4
subithaperiyasamy
No ratings yet
Visualization: Submitted by J.Indira Devi T.Sathya
Document8 pages
Visualization: Submitted by J.Indira Devi T.Sathya
subithaperiyasamy
No ratings yet
Topic:use Statistical Data Analysis To Drive Fact - Based Decisions
Document11 pages
Topic:use Statistical Data Analysis To Drive Fact - Based Decisions
subithaperiyasamy
No ratings yet
Intellingence Model - Conducting The Analysis
Document8 pages
Intellingence Model - Conducting The Analysis
subithaperiyasamy
No ratings yet
Intellingence Model - Conducting The Analysis
Document8 pages
Intellingence Model - Conducting The Analysis
subithaperiyasamy
No ratings yet
Intellingence Model - Conducting The Analysis
Document8 pages
Intellingence Model - Conducting The Analysis
subithaperiyasamy
No ratings yet
Data Analytics and Business Intelligence: Submitted by P.Prabhavathi (10UCS44) M.Shanmugapriya (10UCS57) E.Suganya (10UCS64)
Document17 pages
Data Analytics and Business Intelligence: Submitted by P.Prabhavathi (10UCS44) M.Shanmugapriya (10UCS57) E.Suganya (10UCS64)
aafrinjafar
No ratings yet
Chapter 7
Document12 pages
Chapter 7
Mike Serafino
No ratings yet
Luzande, Mary Christine B ELM-504 - Basic Concepts of Management Ethics and Social Responsibility
Document12 pages
Luzande, Mary Christine B ELM-504 - Basic Concepts of Management Ethics and Social Responsibility
Mary Christine Batongbakal
No ratings yet
Math7 - q1 - Mod12 - Arranging Real Numbers On A Number Line - v3
Document25 pages
Math7 - q1 - Mod12 - Arranging Real Numbers On A Number Line - v3
Vicente Trinidad
No ratings yet
CompeTank-En EEMUA Course
Document6 pages
CompeTank-En EEMUA Course
bacabacabaca
No ratings yet
0452 s07 Ms 3 PDF
Document10 pages
0452 s07 Ms 3 PDF
AbirHuda
No ratings yet
Caregiving Las Week 2
Document16 pages
Caregiving Las Week 2
Florame Oñate
No ratings yet
QMS - Iso 9001-2015 Lead Auditor
Document4 pages
QMS - Iso 9001-2015 Lead Auditor
Salehuddin Omar Kamal
No ratings yet
Notification Detailed
Document28 pages
Notification Detailed
raghu
No ratings yet
Speed Reading Secrets
Document49 pages
Speed Reading Secrets
Jets Campbell
100% (2)
Performance Appraisal Advantages
Document6 pages
Performance Appraisal Advantages
Anna Hudson
No ratings yet
Housekeeping NC Ii Clean and Prepare Rooms For Incoming Guests (40 HOURS) Study Guide
Document3 pages
Housekeeping NC Ii Clean and Prepare Rooms For Incoming Guests (40 HOURS) Study Guide
Sherwin Evangelista
100% (1)
Top 12 Brain-Based Reasons Why Music As Therapy Works: Karen Merzenich
Document5 pages
Top 12 Brain-Based Reasons Why Music As Therapy Works: Karen Merzenich
pitamberrohtan
No ratings yet
Pre Midterm Exam in NSTP 211
Document3 pages
Pre Midterm Exam in NSTP 211
Daeniel Perlado
No ratings yet
Lesson 16 - Designing The Training Curriculum
Document80 pages
Lesson 16 - Designing The Training Curriculum
Charlton Benedict Bernabe
No ratings yet
DETAILED LESSON PLAN IN MATHEMATICS 9 Week 8 Day 1
Document10 pages
DETAILED LESSON PLAN IN MATHEMATICS 9 Week 8 Day 1
Jay Vincent Quintinita
No ratings yet
Quiz 2 MAED 213
Document1 page
Quiz 2 MAED 213
Ramoj Reveche Palma
No ratings yet
Cpts 440 / 540 Artificial Intelligence
Document65 pages
Cpts 440 / 540 Artificial Intelligence
Duc Minh Le
No ratings yet
Task 3 - LP
Document21 pages
Task 3 - LP
Tan S Yee
No ratings yet
Email@:: M.Harini +91-9487293806 +91-7305298360
Document2 pages
Email@:: M.Harini +91-9487293806 +91-7305298360
Praveen Malavae
No ratings yet
Sociology (Latin: Socius, "Companion"; -Ology, "the Study of", Greek λόγος,
Document1 page
Sociology (Latin: Socius, "Companion"; -Ology, "the Study of", Greek λόγος,
Chloe Penetrante
50% (2)
Delos Reyes Sts Ge5 Final PDF
Document4 pages
Delos Reyes Sts Ge5 Final PDF
khryslaine delosreyes
No ratings yet
Problems Encountered by Teachers in The Teaching-Learning Process: A Basis of An Action Plan
Document20 pages
Problems Encountered by Teachers in The Teaching-Learning Process: A Basis of An Action Plan
ERIKA O. FADEROGAO
No ratings yet
(International Library of Technical and Vocational Education and Training) Felix Rauner, Rupert Maclean (Auth.), Felix Rauner, Rupert Maclean (Eds.) - Handbook of Technical and Vocational Educatio PDF
Document1,090 pages
(International Library of Technical and Vocational Education and Training) Felix Rauner, Rupert Maclean (Auth.), Felix Rauner, Rupert Maclean (Eds.) - Handbook of Technical and Vocational Educatio PDF
Aulia Yuanisah
No ratings yet
Models of Career Guidance Practice: Using Personality and Interest Inventories
Document41 pages
Models of Career Guidance Practice: Using Personality and Interest Inventories
Winny Shiru Machira
No ratings yet
Geometer's Sketchpad Unit for Geometry
Document3 pages
Geometer's Sketchpad Unit for Geometry
'Adilin Muhammad
No ratings yet