Overview - Document Intelligence

The objective of the Privacy Preserving Federated Learning Document VQA (PFL-DocVQA) competition is to develop privacy-preserving solutions for fine-tuning multi-modal language models for document understanding on distributed data. We seek efficient federated learning solutions for finetuning a pre-trained generic Document Visual Question Answering (DocVQA) model on a new domain, that of invoice processing.

Automatically managing the information of document workflows is a core aspect of business intelligence and process automation. Reasoning over the information extracted from documents fuels subsequent decision-making processes that can directly affect humans, especially in sectors such as finance, legal or insurance. At the same time, documents tend to contain private information, restricting access to them during training. This common scenario requires training large-scale models over private and widely distributed data.

invoicing_scenario.png

 

The participating teams will create methods to train Document Visual Question Answering models on the provided documents with privacy guarantees, using a federated-learning set-up. The competition is structured in 2 different tracks:

  • Track 1 - Federated Learning only: The methods will be trained within a federated learning framework, simulating the need for cooperation between different entities to achieve the best performing model in the most efficient way. Track 1 participant's objective is to reduce the communication used (#bytes), while achieving a comparable performance with the baseline.
  • Track 2 - Federated Learning + Privacy-preserving: In this track, in addition to training over distributed data, we seek to protect the identity of providers that could be exposed to textual (provider company name) or visual (logo, presentation) information. If a malicious competitor (adversary) manages to infer information about a company's providers, it could have a direct impact on the company's business.

 

PFL-DocVQA Workshop at NeurIPS 2023

We will host a half-day workshop at NeurIPS 2023 on Saturday 16, from 7 a.m. to 10 a.m. PST in hybrid format.

  • Competition workshop link

 

Contact information

For any question about this challenge, please contact info_pfl@cvc.uab.cat 

 

Important Dates

November 15, 2023: Winning teams announced.

November 1, 2023: Privacy proof reports due for Track 2 participant teams.

October 27, 2023: End of the competition. Submission data deadline. 

June 30, 2023: Release of training and validation splits.

June 15, 2023: Competition registration opens.