NOTE - Sponsor reserves the right to award fewer than the advertised number of prizes in the event an insufficient number of submissions/solutions are received that are of reasonable quality to merit a prize.
At Wells Fargo, our data scientists play a key role in driving innovative and meaningful insights that enable our lines of business to provide a world-class experience to our stakeholders. The Campus Analytics Challenge 2022 (“Challenge”) puts you in the role of a data scientist and calls you to use Machine Learning and Natural Language Processing to predict transaction categories. The dataset is small enough that you should be able to work with it on a standard laptop.
To help get your creative juices flowing, we encourage you to explore Machine Learning and Natural Language Processing research, literature and beyond, as you may find a creative approach in other sub-fields of data science.
Challenge Background: It is no surprise that financial companies need to help their customers organize their finances. Customers want to know what they spend their money on to keep balances in check. By categorizing transactions and building better customer engagement tools, Wells Fargo can help customers identify frequent purchases and subscriptions, sort income and activity liability with higher accuracy, and reduce credit risks.
Transaction categorization is the ability to recognize the purpose of a transaction based on its description. For long, this process was done manually but now technology can do it efficiently.
This Challenge will focus on Natural Language Processing using the power of Machine Learning to predict which category a transaction will fall into, given the description of the transaction.
12:00:01 p.m. Eastern Time (“ET”) on 06/13/2022
Challenge Submission Deadline
12:00:01 p.m. ET on 07/13/2022
07/14/2022 – 08/05/2022
Potential Finalists & Winners Notified
08/10/2022 (on or about)
Challenge Objective: This Challenge will focus on Natural Language Processing using the power of Machine Learning to predict which category a transaction will fall into, given the description of the transaction. Your solution must meet:
- The Challenge Criteria
- Follow the Challenge Instructions and Requirements
- Incorporate the Key Deliverables, each described in detail below.
Eligibility: This Challenge is sponsored by Wells Fargo Bank, N.A. (“Sponsor” or “Wells Fargo”) for full-time or part-time students, 18 years of age or older at the time of entry, who are enrolled in any higher education degree program on campus or online at colleges or universities in the United States and District of Columbia, including students attending two- and four-year programs, technical and vocational schools, junior and community colleges, as well as graduate and professional education students (collectively “Students”).
Employees of Wells Fargo or MindSumo, Inc. and their respective parents, divisions, affiliates, subsidiaries, their promotional or marketing agencies, government entities and public officials, and their immediate family members (parent, child, sibling and spouse) and persons living in the same households of each such employee (whether related or not) are not eligible. To be eligible to receive any prize, potential winners must have a valid U.S. tax identification number and meet all the eligibility requirements at the time the prize is awarded. Potential winners will be required to provide Sponsor with proof that they meet the eligibility requirements for this Challenge. Void where prohibited by law.
Challenge Criteria: Build a model to predict transaction categories using the 10 (ten) distinct categories that a transaction may fall into. The categories are as follows:
• Communication Services
• Health and Community Service
• Property and Business Services
• Retail Trade
• Services to Transport
• Trade, Professional and Personal Services
The dataset provided on the Challenge page is synthetic. This dataset has been scrubbed to replace all transaction numbers with 1's.
Training Dataset – The Training Dataset contains 40,000 unique transactions and their corresponding transaction categories. This dataset should be used to develop your solution.
Test Dataset – The Test Dataset contains 10,000 unique transactions with the transaction categories omitted. This dataset should be used to test your solution and identify the correct transaction categories.
Challenge Instructions and Requirements: When creating your Solution, you may use a novel combination of existing Machine Learning and/or Natural Language Processing or develop your own novel method in order to extract and/or represent thematic information from the data file.
You must provide citations and sources for any additional data and/or methodologies used.
Deliverable 1) Describe, using an abstract, your approach and methodology. Include a visual representation of your analytic process flow.
Deliverable 2) Return the uncategorized content dataset with the categories identified.
Deliverable 3) Document your code and reference the analytic process flow-diagram from deliverable one (1).
Submitting as a Team: You are welcome to work as a team on the Campus Analytics Challenge. However, you should only provide ONE submission for the entire team.