Categories

See More
Popular Forum

MBA (4887) B.Tech (1769) Engineering (1486) Class 12 (1030) Study Abroad (1004) Computer Science and Engineering (988) Business Management Studies (865) BBA (846) Diploma (746) CAT (651) B.Com (648) B.Sc (643) JEE Mains (618) Mechanical Engineering (574) Exam (525) India (462) Career (452) All Time Q&A (439) Mass Communication (427) BCA (417) Science (384) Computers & IT (Non-Engg) (383) Medicine & Health Sciences (381) Hotel Management (373) Civil Engineering (353) MCA (349) Tuteehub Top Questions (348) Distance (340) Colleges in India (334)
See More

Algorithm to cluster chart type

Course Queries Syllabus Queries
Max. 2000 characters
Replies

usr_profile.png
NeelKamal Jha

User

( 5 months ago )


So, I have more than 20,000 entities. Each entities has their own data point (time series). Let say entities A1 to A20000. A1 has data point from year 1 to year 60. A2 has data point from year 5 to 60, and so on. We can make some plot year vs value each year for each entities.

My task now is to make a cluster of the entity based on the shape the chart they make. For example, A1 data point chart (assume barplot) will make quadratic-like shape, A2 data point chart will make exponential -like shape, and so on. There would be some entity with random chart shape like scattered.

Is there any algorithm to create this type of clustering? I tried to create just 1 shape detection algorithm, monotonic increase shape, and I think it works good but I need an automatic shape detection algorithm. My method also still not robust enough to detect some small fluctuation. For example in the monotonic increase shape (the data in the newer year is greater than its previous year), if some data in a year dropped a quite big, it failed to detect it is monotonic increase type although generally speaking, it is monotonic increase.

Any suggestion?

usr_profile.png
Garry Buttler

User

( 5 months ago )

IMHO, I think that you should run an automated Empirical Orthogonal Functions (EOF) Analysis, with its corresponding principal components, to cluster your entities based on the "chart" type.

In a sense, EOF Analysis is conceptually similar to Fourier Analysis (FA), with the only difference that in FA you always use a set of known functions (Sine and Cosine) as the eigenfunctions, while in EOF Analysis, this tools helps us determine what the eigenfunctions should be, and their relative importance.

I hope this helps you with this problem.

Kind regards, GEN

what's your interest


forum_ban8_5d8c5fd7cf6f7.gif