ECML-PKDD LOGO

Applied Data Science Track

The Applied Data Science Track solicits submissions that present compelling applications of Knowledge Discovery, Data Mining, and Machine Learning to solve challenging and important real-world problems, thereby bridging the gap between practice and theory. The submitted papers should clearly explain the specific real-world challenges addressed (encompassing aspects such as size, data quality, and evaluation), the methodology used, and the conclusions and implications that are drawn for the respective use cases.

 

The ADS track aims to accept papers that provide practitioners with valuable insights into how to apply Knowledge Discovery, Data Mining, and Machine Learning in real-world scenarios, showcasing their practical relevance. Alternatively, submissions may highlight novel use cases that contribute to expanding the understanding of the applicability of these methodologies in practical settings. Lastly, the track welcomes papers contributing to the overall knowledge base on the real-world application of Knowledge Discovery, Data Mining, or Machine Learning. The long-term impact on the community is the main criterion considered in the evaluation process.

 

Although not mandatory, authors are encouraged to comment on the deployment goals of their solutions and provide relevant details regarding deployment status and plans. In this context, deployment refers to the practical implementation or application of the proposed solutions in real-world scenarios. The track acknowledges that not all papers may present a deployed solution but emphasises the importance of explicitly discussing deployment in a realistic and transparent way.

 

Regarding methodological novelty, this track does not require the introduction of novel techniques. Instead, the emphasis is placed on the relevance and impact of the applied solutions to real-world challenges, even if proposed systems combine previously known building blocks. Authors are encouraged to focus on their work's practical implications and significance rather than on introducing novel methodologies.

 

Finally, the use of proprietary data is acceptable. However, it is expected that at least some portion of the data be made public, or a public dataset is also used to facilitate transparency and reproducibility.

Key Dates and Deadlines

Submission Site

CMT submission site opens

Feb 15 2024

Abstract submission deadline

Mar 15 2024

Paper Submission Deadline

Mar 22 2024

Author notification

May 27 2024

Camera Ready Submission

Jun 14 2024

*All deadlines expire on 23:59 AoE (UTC - 12)

VIEW OPTIONS

Paper Format

Authorship

Double-blind Reviewing Process

Submission Process

Conference Attendance

Proceedings

Reproducible Research Papers

Ethics Considerations

Authors Commit to Reviewing

Dual Submission Policy

Conflict of Interest

Contact

Paper Format

Papers must be written in English and formatted in LaTeX, following the outline of our author kit. The kit includes a readme document, a LaTeX file template containing author instructions, and style files. The maximum length of papers is 16 pages (including references) in this format. The program chairs reserve the right to reject any over-length papers without review. Papers that ‘cheat’ the page limit by, including but not limited to, using smaller than specified margins or font sizes will also be treated as over-length. Note that, for example, negative vspaces are also not allowed by the formatting guidelines; further details can be found in the author kit. Up to 10 MB of additional materials (e.g., proofs, audio, images, video, data, or source code) can be uploaded with your submission. The reviewers and the program committee reserve the right to judge the paper solely on the basis of the 16 pages of the paper; looking at any additional material is at the discretion of the reviewers and is not required.

Authorship

The author list as submitted with the paper is considered final. No changes to this list may be made after paper submission, either during the review period, or in case of acceptance, at the final camera-ready stage.

Double-blind Reviewing Process

Similarly to previous years, we will apply a double-blind review-process (author identities are not known by reviewers or area chairs; reviewers do see each other’s names). All papers need to be ‘best-effort’ anonymized. Papers must not include identifying information of the authors (names, affiliations, etc.), self-references, or links (e.g., GitHub, YouTube) that reveal the authors’ identities (e.g., references to own work should be given neutrally like other references, not mentioning ‘our previous work’ or similar). We strongly encourage making code and data available anonymously (e.g., in an anonymous Github repository, or Dropbox folder). The authors might have a (non-anonymous) pre-print published online, but it should not be cited in the submitted paper to preserve anonymity. Reviewers will be asked not to search for them. We recognize there are limits to what is feasible with respect to anonymization. For example, if you use data from your own organization and it is relevant to the paper to name this organization, you may do so.

Submission Process

Electronic submissions will be handled via CMT available here. Submissions will be evaluated by three reviewers on the basis of novelty, technical quality, potential impact, and clarity.

Conference Attendance

For each accepted paper, at least one author must register for the main conference and present the paper in person.

Proceedings

The conference proceedings will be published by Springer in the Lecture Notes in Computer Science Series (LNCS).

Reproducible Research Papers

Authors are strongly encouraged to adhere to the best practices of Reproducible Research, by making available data and software tools that would enable others to reproduce the results reported in their papers. We advise the use of standard repository hosting services such as Dataverse, mldata.org, OpenML, figshare, or Zenodo for data sets, and mloss.org, Bitbucket, GitHub, or figshare (where it is possible to assign a DOI) for source code. If data or code gets updated after the paper is published, it is important to enable researchers to access the versions that were used to produce the results reported in the paper. Authors who do not have a preferred repository are advised to consult Springer Nature’s list of recommended repositories and research data policy.

Ethics Considerations

Ethics is one of the most important topics to emerge in Machine Learning and Data Mining. We ask you to think about the ethical implications of your submission – such as those related to the collection and processing of personal data or the inference of personal information, the potential use of your work for policing or the military. You will be asked in the submission form about the ethical implications of your work which will be taken into consideration by the reviewers.

Authors Commit to Reviewing

Authors of submitted papers agree to be potential PC members/reviewers for ECML PKDD 2024 and may be asked to review papers for the conference. This does not apply to authors who are (a) already contributing to ECML PKDD (e.g., accepted a PC/AC invite, are part of the organizing committee) or (b) not qualified to be ECML PKDD PC members (e.g., limited background in ML or DM). This requirement can be waived in a limited range of exceptional cases (e.g., parental leave, long-term illness).

Dual Submission Policy

Papers submitted should report original work. Papers that are identical or substantially similar to papers that have been published or submitted elsewhere may not be submitted to ECML PKDD, and the organizers will reject such papers without review. Authors are also NOT allowed to submit or have submitted their papers elsewhere during the review period. Submitting unpublished technical reports available online (such as on arXiv), or papers presented in workshops without formal proceedings, is allowed, but such reports or presentations should not be cited to preserve anonymity.

Conflict of Interest

During the submission process, you must enter the email domains of all institutions with which you have an institutional conflict of interest. You have an institutional conflict of interest if you are currently employed or have been employed by that institution in the past three years, or you have extensively collaborated with the institution within the past three years. Authors are also required to identify all Program Committee Members and Area Chairs with whom they have a conflict of interest. Examples of conflicts of interest include: co-authorship in the last five years, colleague in the same institution within the last three years, and advisor/student relations (anytime in the past).

Contact

For further information, please contact Mail: ecml-pkdd-2024-applied-data-science-track@googlegroups.com