--- license: afl-3.0 --- Multilingual Bert base (multilingual uncased) model trained to predict [CAP issue codes](https://www.comparativeagendas.net/pages/master-codebook). Model training on 120,000 assorted political documents -- mostly from the [Comparative Agendas Project](https://www.comparativeagendas.net/) Countries: - Italy - Sweden - France - Switzerland - Poland - Netherlands - Germany - Denmark - Spain - UK - Austria - Ireland LABELS USED IN TRAINING Model labels -> CAP labels: {0: 1.0, 1: 2.0, 2: 3.0, 3: 4.0, 4: 5.0, 5: 6.0, 6: 7.0, 7: 8.0, 8: 9.0, 9: 10.0, 10: 12.0, 11: 13.0, 12: 14.0, 13: 15.0, 14: 16.0, 15: 17.0, 16: 18.0, 17: 19.0, 18: 20.0, 19: 23.0} ========================================================================================================== ========================================================================================================== ========================================================================================================== Model labels -> CAP issues: {0: 'macroeconomics', 1: 'civil_rights', 2: 'healthcare', 3: 'agriculture', 4: 'labour', 5: 'education', 6: 'environment', 7: 'energy', 8: 'immigration', 9: 'transportation', 10: 'law_crime', 11: 'social_welfare', 12: 'housing', 13: 'domestic_commerce', 14: 'defense', 15: 'technology', 16: 'foreign_trade', 17: 'international_affairs', 18: 'government_operations', 19: 'culture'} ========================================================================================================== ========================================================================================================== ========================================================================================================== CAP labels -> Model labels: {1.0: 0, 2.0: 1, 3.0: 2, 4.0: 3, 5.0: 4, 6.0: 5, 7.0: 6, 8.0: 7, 9.0: 8, 10.0: 9, 12.0: 10, 13.0: 11, 14.0: 12, 15.0: 13, 16.0: 14, 17.0: 15, 18.0: 16, 19.0: 17, 20.0: 18, 23.0: 19} ========================================================================================================== ========================================================================================================== ========================================================================================================== CAP issues -> Model labels: {'macroeconomics': 0, 'civil_rights': 1, 'healthcare': 2, 'agriculture': 3, 'labour': 4, 'education': 5, 'environment': 6, 'energy': 7, 'immigration': 8, 'transportation': 9, 'law_crime': 10, 'social_welfare': 11, 'housing': 12, 'domestic_commerce': 13, 'defense': 14, 'technology': 15, 'foreign_trade': 16, 'international_affairs': 17, 'government_operations': 18, 'culture': 19} ========================================================================================================== ========================================================================================================== ==========================================================================================================