• About Us
  • News
  • Events
  • Student Affairs
  • Career Development Centre
  • Students@Engineering
  • Academics
    • Programmes
      • Undergraduate Programmes
      • Graduate Programmes
        • Masters Programmes
        • Doctoral Programmes
    • Teaching Laboratories
    • Virtual Laboratories
    • Project Based Learning
  • Admission
    • Undergraduate Admission
    • Graduate Admission
      • Masters Admissions
      • Doctoral Admissions
  • People
  • Research
  • About Us
  • News
  • Events
  • Office of the Dean of Students
  • Career Development Centre
  • Students@Engineering
  • Academics
    Programmes Teaching Laboratories Virtual Laboratories Project Based Learning
  • Admission
    Undergraduate Admission Graduate Admission Doctoral Admission
  • People
  • Research

Cost-Sensitive Big Data Analytics

Sanjay Chaudhary
Ankit Desai
School of Engineering and Applied Science

ABSTRACT

We are developing an algorithm which works in a distributed environment with a goal to reduce the overall misclassification cost. Moreover, this will solve the problem of learning from the highly imbalanced dataset as Cost-Sensitive classification is majorly applied in solving class imbalance problem.

Description

Data mining classification algorithms can be classified into two categories. i.e. error-based model (EBM) and cost-based model (CBM). EBM does not incorporate the cost of misclassification in the model building phase while CBM does. EBM treats all errors equally likely, which is not the case with all real-world applications like credit card fraud detection, medical diagnosis etc. Shopping carts, credit card fraud detection system, loan approval system, medical diagnosis etc. are some example systems, which largely works in spread across the environment. Therefore, to perform classification for such data requires a distributed system. Moreover, in such applications, the volume of the data is very high. CBM in the distributed environment helps in reducing the overall misclassification cost. As part of our research, we are developing an algorithm which works in a distributed environment with a goal to reduce the overall misclassification cost. Moreover, this will solve the problem of learning from the highly imbalanced dataset as Cost-Sensitive classification is majorly applied in solving class imbalance problem.

 

 

Keywords: Data Science: Cloud Computing, Data Analytics and Machine Learning

School of Engineering and Applied Science

Ahmedabad University
Central Campus
Navrangpura, Ahmedabad 380009
Gujarat, India

[email protected]
+91.79.61911100

  • About Ahmedabad
  • Our Purpose
  • Programmes
  • Admission
  • Research
  • Resources
  • Brochures
  • News
  • Events
  • People
  • Careers
  • Contact

Auris

COPYRIGHT AHMEDABAD UNIVERSITY 2025

CONNECT WITH US

Download Brochure

Please enter information in the form below. The download will start automatically on submission of the form.

Download Brochure

Please enter information in the form below. The download will start automatically on submission of the form.