Avatar

 

Thanh-Nghi Do

A/Prof. @ Dept. of Comp. Networks
WWW: http://www.cit.ctu.edu.vn/~dtnghi
Email: dtnghi@cit.ctu.edu.vn
Address: 3/2 Street, Ninh Kieu District, 94000-CanTho, Viet Nam
Tel: +84 (0)2923 734 720

Education

Dec 2004
  • Ph.D. in Informatics

    Visualization and Support Vector Machine in Data Mining
    LINA, Laboratory for Computer Science, Nantes University, France
    Thesis advisors: Prof. Henri Briand, Dr. François Poulet
Jul 2002
  • Master research in Informatics

    Visualization and Support Vector Machine in Data Mining
    LINA, Laboratory for Computer Science, Nantes University, France
    Advisor: Dr. François Poulet
Aug 2001
  • Master in Informatics

    IFI, Francophone Institute for Computer Science Hanoi, Vietnam
    Advisor: Dr. Philippe Massonet
Jul 1996
  • Bachelor of Engineering in Informatics

    College of Information Technology, Cantho University, Vietnam
    Advisor: Prof. Hoang Kiem

Distinction

Nov. 2015
  • Qualif. for Associate Professor (A/Prof.)

    Information Technology
Jan. 2010
  • Qualif. for Maître de Conférences (MCF-27)

    Informatics
Jan. 2005
  • Qualif. for Maître de Conférences (MCF-27)

    Informatics

Research interests

2001 - present
  • Data mining and Knowledge discovery in databases

    Data mining with SVM and Kernel-based methods, Ensemble methods, Decision tree

    Information visualization in knowledge discovery in databases, Visual data mining

    Mining complex data: very-high-dimensional, large scale, imbalanced datasets

Experience

2012 - 2013
  • Visiting scientist

    DECIDE, URM 6285 Lab-STICC, with A/Prof. Sorin Moga, Telecom-Bretagne, France.
    Automatic Configuration of Enterprise Resource Planning
2008 - 2010
  • Visiting postdoc

    DECIDE, URM 6285 Lab-STICC, with Prof. Philippe Lenca, Telecom-Bretagne, France.
    Decision Trees for Classifying Very-High-Demensional and Imbalanced Data
2006 - 2008
  • Visiting postdoc

    AVIZ, INRIA Saclay, with Prof. Jean-Daniel Fekete, France.
    SEVEN: Visual Analytical Project, Visual Programming Platform for Data Mining
2005 - present
  • Lecturer

    Computer Networks Department, Can Tho University, Vietnam.
    Teaching: Data Mining, Machine Learning, Linux/OSS, Web, Parallel Programming

Publications

  • Journal, book chapter

    O. Et-Targuy, C. Delenne, S. Benferhat, A. Begdouri, T-N. Do, T-T. Ma. From GIS to Graphical Representation for Maintaining Connectivity of Wastewater Network Elements. in SN Computer Science, Vol.5(7): 851, Springer, 2024.

    T-T. Vo, T-N. Do. Improving Chest X-ray Image Classification via the Integration of Self-Supervised Learning and Machine Learning Algorithms. in Journal of Information and Communication Convergence Engineering, Vol.22(2): 165-171, 2024.

    T-H. Nguyen, T-N. Do. Pre-training clustering models to summarize Vietnamese texts. in Vietnam Journal of Computer Science, Vol.xx(xx): xx-yy, World Scientific Publishing, 2024.

    T-N. Do. Enhancing Gene Expression Classification Through Explainable Machine Learning Models. in SN Computer Science, Vol.5(606): 1-116, Springer, 2024.

    T-N. Do, M-T. Tran-Nguyen. ImageNet Classification with Raspberry Pis: Federated Learning Algorithms of Local Classifiers. in Intl Journal of Web Information Systems, Vol.20(1): 48-65, 2024.

    T-N. Do, V-T. Le, T-H. Doan. SVM on Top of Deep Networks for Covid-19 Detection from Chest X-ray Images. in Journal of Information and Communication Convergence Engineering, Vol.20(3): 219-225, 2022.

    T-N. Do. Incremental and Parallel Proximal SVM Algorithm Tailored on the Jetson Nano for the ImageNet Challenge. in Intl Journal of Web Information Systems, Vol.18(2/3): 137-155, 2022.

    T-H. Nguyen, T-N. Do. Text Summarization on Vietnamese Large-scale Dataset. in Journal of Information and Communication Convergence Engineering, Vol.20(4): 309-316, 2022.

    T-N. Do. Training Neural Networks on Top of Support Vector Machine Models for Classifying Fingerprint Images. in SN Computer Science, Vol.2(5), Springer, 2021.

    T-N. Do, T-P. Pham, H-H. Nguyen, N-K. Pham. Visual Classification of Intangible Cultural Heritage Images in the Mekong Delta. Chapter 4 in Data Analytics for Cultural Heritage, Springer, 2021, pp.71-89.

    T-N. Do. Automatic Learning Algorithms for Local Support Vector Machines. in SN Computer Science, Vol.1(1), Springer, 2020.

    M-T. Tran-Nguyen, L-D. Bui, T-N. Do. Decision tree using local support vector regression for large datasets. in Journal of Information & Telecommunication, Vol.4(1): 17-35, Taylor & Francis, 2020.

    P-H. Vo, T-S. Nguyen, V-T. Huynh, T-N. Do. A High capacity invertible steganography algorithm using 2-D histogram shifting with EDH. Chapter 6 in the book Digital Media Steganography: Principles, Algorithms, Advances, ELSEVIER Inc., 2020, pp.99-122.

    P-H. Huynh, V-H. Nguyen, T-N. Do. Improvements in the large p, small n classification issue. in SN Computer Science, Vol.1(4): 1-19, Springer, 2020.

    T-N. Do, F. Poulet. Latent-lSVM classification of very high-dimensional and large scale multi-class datasets. in Concurrency and Computation: Practice and Experience, Vol.31(2):e4224, Wiley, 2019.

    T-N. Do, L-D. Bui. Parallel learning algorithms of local support vector regression for dealing with large datasets. in The LNCS Journal Transactions on Large-Scale Data- and Knowledge-Centered Systems, Vol.41:59-77, Springer, 2019.

    P-H. Huynh, V-H. Nguyen, T-N. Do. Novel hybrid DCNN-SVM model for classifying RNA-Sequencing gene expression data. in Journal of Information & Telecommunication, Vol.3(4): 533-547, Taylor & Francis, 2019.

    P-H. Huynh, V-H. Nguyen, T-N. Do. Enhancing gene expression classification of support vector machines with generative adversarial networks. in Journal of Information and Communication Convergence Engineering, Vol.17(1):14-20, 2019.

    P-H. Vo, T-S. Nguyen, V-T. Huynh and T-N. Do. A Novel Reversible Data Hiding Scheme with Two-Dimensional Histogram Shifting Mechanism. in International Journal of Multimedia Tools and Applications, Vol.77(21): 28777-28797, Springer, 2018.

    T-N. Do, F. Poulet. Parallel learning of local SVM algorithms for classifying large datasets. in The LNCS Journal Transactions on Large-Scale Data- and Knowledge-Centered Systems, Vol.31:67-93, Springer, 2017.

    T-N. Do, P. Lenca, S. Lallich. Classifying Many-Class High Dimensional Fingerprint Datasets Using Random Forest of Oblique Decision Trees. in Vietnam Journal of Computer Science, Vol.2(1): 3-12, Springer, 2015.

    T-N. Do, N-K. Pham. Handwritten Digit Recognition Using GIST Descriptors and Random Oblique Decision Trees. in Advances in Intelligent Systems and Computing, Vol.341: 1-15, Springer, 2015.

    T-N. Doan, T-N. Do, F. Poulet. Large Scale Classifiers for Visual Classification Tasks. in International Journal of Multimedia Tools and Applications, Vol.74(4): 1199-1224, Springer, 2015.

    T-N. Do, H-A. Le-Thi. Massive Classification with Support Vector Machines. in Transactions on Computational Collective Intelligence XVIII, Springer Berlin Heidelberg, 2015, pp. 147-165.

    T-N. Doan, T-N. Do, F. Poulet. Classification d'images à grande échelle avec des SVMs. in Revue Traitement du Signal, Vol.31(1-2): 39-56, LAVOISIER, 2014.

    T-N. Do. Parallel Multiclass Stochastic Gradient Descent Algorithms for Classifying Million Images with Very-High-Dimensional Signatures into Thousands Classes. in Vietnam Journal of Computer Science, Vol.1(2): 107-115, Springer, 2014.

    T-N. Doan, T-N. Do, F. Poulet. Parallel Incremental Power Mean SVM for the Classification of Large Scale Image Datasets. in International Journal of Multimedia Information Retrieval, Vol.3(2): 89-96, Springer, 2014.

    T-N. Do, S. Lallich, N-K. Pham and P. Lenca. Classifying very-high-dimensional data with random forests of oblique decision trees. in Advances in Knowledge Discovery and Management, Studies in Computational Intelligence Vol.292: 39-55, Springer-Verlag, 2010.

    T-N. Do, V-H. Nguyen, F. Poulet. GPU-based parallel SVM algorithm. in Journal of Frontiers of Computer Science and Technology, Vol.3(4): 368-377, 2009.

    T-N. Do and F. Poulet. Interval Data Mining with Kernel-based Algorithms and Visualization. Chapter 5 in Mining Complex Data for Knowledge Discovery: Advances and Applications, Studies in Computational Intelligence Vol.165: 75-91, Springer-Verlag, 2009.

    F. Poulet and T-N. Do. Interactive Decision Tree Construction for Interval and Taxonomical data. in Visual Data Mining: Theory, Techniques and Tools for Visual Analytics, Lecture Notes in Computer Science Vol.4404: 123-135, Springer-Verlag, 2008.

    T-N. Do et J-D. Fekete. V4Miner pour la fouille de données. in Review of Artificial Intelligence, Vol.22/3-4: 503-517, 2008.

    N-K. Pham, T-N. Do, F. Poulet et A. Morin. Tree-view pour l'exploration interactive des arbres de décision. in Review of Artificial Intelligence, Vol.22/3-4: 473-487, 2008.

    T-N. Do and F. Poulet. Vis-SVM : approche coopérative en fouille de données. in Numéro Spécial Visualisation et Extraction de Connaissances, Revue des Nouvelles Technologies de l'Information – Série Extraction et Gestion des Connaissances RNTI-E-7: 49-74, 2006.

    F. Poulet and T-N. Do. Mining Very Large Datasets with Support Vector Machine Algorithms. in Enterprise Information Systems V, Kluwer Academic Publishers, 2004, pp. 177-184.
  • Edited book

    T-N. Nguyen, T-N. Do, S. Benferhat. Second International Conference on ISDS (Intelligent Systems and Data Science) 2024.

    T-N. Nguyen, T-N. Do, P. Haddawy. First International Conference on ISDS (Intelligent Systems and Data Science) 2023.

    P. Lenca, S. Lallich, T-N. Do. QIMIE Workshop (Quality Issues, Measures of Interestingness and Evaluation of Data Mining Models). PAKDD Workshops 2015.

    F. Poulet, B. LeGrand, T-N. Do and M-A. Aufaure. Acte de l'Atelier Visualisation et extraction de connaissances. 9èmes Journées d'Extraction et Gestion des Connaissances 2009.

    F. Poulet, B. LeGrand, T-N. Do. Acte de l'Atelier Visualisation et extraction de connaissances. 8èmes Journées d'Extraction et Gestion des Connaissances 2008.
  • Technical report

    J-D. Fekete, N. Elmqvist, T-N. Do, H. Goodell and N. Henry. Navigating Wikipedia with the Zoomable Adjacency Matrix Explorer. INRIA Research Report, Technical Report No. RR:00141168, 2007.

    T-N. Do and F. Poulet. La catégorisation de textes. Rapport de contrat Fondation Vediorbis. ESIEA Recherche, Laval, 2004.
  • Thesis

    T-N. Do. Visualisation et séparateurs à vaste marge en fouille de données. Thèse de Doctorat de l'Université de Nantes, Décembre 2004.

    T-N. Do. Visualisation et fouille de données. Rapport de DEA, Université de Nantes, Juillet 2002.

Professional Service

  • Workshop Organization

    QIMIE 2015 is organized in association with the PAKDD 2015 conference, with Prof. P. Lenca, Prof. S. Lallich

    IEEE-RIVF 2015 International Conference on Computing and Communication Technologies, Workshop chair

    VisECD 2009 is organized in association with the EGC 2009 conference, with Prof. F. Poulet, Prof. B. Legrand, Prof. M-A. Aufaure

    VisECD 2008 is organized in association with the EGC 2008 conference, with Prof. F. Poulet, Prof. B. Legrand
  • Program committee member, reviewer

    ACML 2017-2019, The Asian Conference on Machine Learning

    IJCAI 2019, The Intl Joint Conferences on Artificial Intelligence

    KDIR 2014-2022, The Intl Conf. on Knowledge Discovery and Information Retrieval

    MCO/ICCSAMA 2014,2015,2021, The Intl Conf. on Computer Science, Applied Math. and Appl.

    FDSE 2014-2022, The Intl Conf. on Future Data and Security Engineering

    ACOMPA 2022, The Intl Conf. on Advanced Computing and Analytics

    SoICT 2018, The Intl Symposium on Information and Communication Technology

    MAPR 2018-2019, The Intl Conf. on Multimedia Analysis and Pattern Recognition

    VAST 2013, The IEEE Conf. on Visual Analytics Science and Technology

    DS 2012, The Intl Conf. on Discovery Science 2012

    ACIIDS 2010-2016, The Asian Conf. on Intelligent Information and Database Systems

    ICTACS 2010-2011, The Intl Conf. on Theories and Applications of Computer Science

    CIE39, The Intl Conf. on Computers & Industrial Engineering 2009

    DMIN 2008-2010, The Intl Conf. on Data Mining

    VIEW 2006-2007, Visual Information Expert Workshop

    AusDM 2004, The Australasian Data Mining Conference 2004

    ASMDA 2005, the Intl Symposium on Applied Stochastic Models and Data Analysis 2005

    Atelier Qualité des Données et des Connaissances 2008-2011

    Atelier Visualisation et extraction de connaissances 2005-2009

    Journal of Intelligent Information Systems 2013

    Journal of Experimental Algorithmics 2009

    Advances in Knowledge Discovery and Management 2009

    Pattern Recognition Elsevier 2008

    RNTI, Revue des Nouvelles Technologies de l'Information, Cépaduès Editions, 2006-2008

    MCO 2008, The Intl Conf. on Modelling, Computation and Optimization in Information Systems and Management Sciences 2008

    I3, Information-Interaction–Intelligence, Cépaduès Editions, 2006

    FAIR 2014-2022, Fundamental And Applied IT Research

    ICTFIT 2008-2012, The National conference in computer science
  • Invited seminars/talks

    National Conference on Fundamental and Applied Information Technology Research, Vietnam, 11/2022

    National Conference on Fundamental and Applied Information Technology Research, Vietnam, 08/2016

    FIT, University of Technology HCM, Vietnam, 03/2014

    LITA Metz, University of Lorraine, France, 12/2012

    Faculty of Information Technology, Dong Thap University, Vietnam, 05/2011

    Faculty of Information Technology, Bac Lieu University, Vietnam, 06/2011

    An Giang University, Vietnam, 08/2010

    Software Center of Cantho University, Vietnam, 04/2005

    IRISA Rennes, France, 01/2005
  • Ph.D. Advisor

    Phuoc-Hung Vo. steganography. Can Tho University, Vietnam, Dec/2020

    Phuoc-Hai Huynh. Gene expression classification. Can Tho University, Vietnam, Sept/2020
  • Ph.D. Defense Committee

    Xuan-Huyen Do. Developing the memory organization for CAPE. University of Hue, Vietnam, Jan/2023

    Hong-Son Trang. Several approaches for solving the personal scheduling problem. University of Technology HCM, Vietnam, Jan/2022

    Thanh-Tuyen Do Thi. Vietnamese document retrieval based on semantics. University of Information Technology HCM, Vietnam, Sept/2020

    Cong-Chien Ta Duy. Building information extraction system based on computing domain ontology. University of Technology HCM, Vietnam, June/2017

    Thanh-Son Nguyen. Time series mining. University of Technology HCM, Vietnam, May/2014

    Emilien Gauthier. Evaluation du risque de maladie: conception d'un processus et d'un système d'information permettant la construction d'un score de risque adapté au contexte, application au cancer du sein. University of Bretagne, France, Jan/2013

Last update Jan 2025