This study proposes the mechanism to collect real time data relating to education sector into a centralize repository and then to process specific portion of collected data using machine learning algorithms to compare performance of public and private sector higher education systems in Pakistan. Data of selected batch of students from colleges and universities are interlinked through a central repository that organizes and processes data to monitor the overall performance in terms of student retention gives trends of admissions and drop outs. A framework to that will facilitate the stakeholders including Government, Students and institutions to overview the progress at any level and can be sliced to any institution as well.
Keywords : Machine Learning, Repository, Distributed Systems, Data Sharing, Algorithms.