16May

Data Scientist – Arquitectura Contable y Financial


Arquitectura Contable y Financial Business Intelligence (FBI)

¿QUÉ PROYECTOS DESARROLLAMOS?

 Somos un área donde los datos juegan un papel relevante, debemos destacar la importancia de tener robustez en los procesos, con información fiable y en tiempo. Esto nos permitirá seguir proporcionando servicio, tanto a supervisores como a la alta dirección, en un alto grado de calidad.

Hemos lanzado distintas iniciativas con el área de Medios que van encaminadas a iniciar un proceso de transformación en cada uno de los ámbitos que consideramos estratégicos: Operacional – Arquitectura Contable – Datos y Usos – Herramientas de procesos y reporting.

Como equipo impulsamos la digitalización mediante la optimización de procesos, herramientas tecnológicas e implantamos nuevos métodos de trabajo mediante proyectos. Gestionamos la interlocución con el área de Medios y centralizamos el presupuesto QBR de nuestra área y sus iniciativas asociadas.

Los proyectos en los que paticiparas en la posición son:

  • Implantar proyectos de soporte en el área de Contabilidad y Reporting Legal Integrado
  • Participar en el cierre contable individual y consolidado desde un punto de vista de proceso y herramientas de Caixabank y del Grupo.
  • Dar soporte al resto del área en materia de datos y herramientas, mediante extracciones de información, generación de Cuadros de Mando (CdM) y soluciones que permitan la toma de decisión de los procesos más relevantes.
  • Implantar nuevas maneras de trabajar en el área:
    • Analizando y optimizando procesos
  • Desarrollar Proyectos de Automatización mediante Robotics | Macros
  • Generar la definición de Nuevas herramientas
  • Participar activamente en el plan de transformación del área
  • Creación de valor en proyectos de
    • Visualización avanzada mediante Qlik
    • Analítica avanzada – Modelos – Python
  • Role de facilitador en la gestión del cambio en el área



Source link

16May

CMU Researchers Propose MOMENT: A Family of Open-Source Machine Learning Foundation Models for General-Purpose Time Series Analysis


Pre-training large models on time series data faces several challenges: the lack of a comprehensive public time series repository, the complexity of diverse time series characteristics, and the infancy of experimental benchmarks for model evaluation, especially under resource-constrained and minimally supervised scenarios. Despite these hurdles, time series analysis remains vital across applications like weather forecasting, heart rate irregularity detection, and anomaly identification in software deployments. Utilizing pre-trained language, vision, and video models offers promise, though adaptation to time series data specifics is necessary for optimal performance.

Applying transformers to time series analysis presents challenges due to the quadratic growth of the self-attention mechanism with input token size. Treating time series sub-sequences as tokens enhances efficiency and effectiveness in forecasting. Utilizing cross-modal transfer learning from language models, ORCA extends pre-trained models to diverse modalities through align-then-refine fine-tuning. Recent studies have utilized this approach to reprogram language pre-trained transformers for time series analysis, albeit resource-intensive models require substantial memory and computational resources for optimal performance.

Researchers from Carnegie Mellon University and the University of Pennsylvania present MOMENT, an open-source family of foundation models for general-purpose time series analysis. It utilizes the Time series Pile, a diverse collection of public time series, to address time series-specific challenges and enable large-scale multi-dataset pretraining. These high-capacity transformer models are pre-trained using a masked time series prediction task on extensive data from various domains, offering versatility and robustness in tackling diverse time series analysis tasks.

MOMENT begins by assembling a diverse collection of public time series data called the Time Series Pile, combining datasets from various repositories to address the scarcity of comprehensive time-series datasets. These datasets encompass long-horizon forecasting, short-horizon forecasting, classification, and anomaly detection tasks. MOMENT’s architecture involves a transformer encoder and a lightweight reconstruction head pre-trained on a masked time series prediction task. The pre-training setup includes variations of MOMENT corresponding to different sizes of encoders, trained with Adam optimizer and gradient checkpointing for memory optimization. MOMENT is designed for fine-tuning downstream tasks such as forecasting, classification, anomaly detection, and imputation, either end-to-end or with linear probing, depending on the task requirements.

The study compares MOMENT with state-of-the-art deep learning and statistical machine learning models across various tasks, contrary to TimesNet, which mainly focuses on transformer-based approaches. These comparisons are essential for evaluating the practical applicability of the proposed methods. Interestingly, statistical and non-transformer-based methods, such as ARIMA for short-horizon forecasting, N-BEATS for long-horizon forecasting, and k-nearest neighbors for anomaly detection, demonstrate superior performance over many deep learning and transformer-based models.

To recapitulate, this research presents MOMENT, the first open-source family of time series foundation models developed through comprehensive stages of data compilation, model pre-training, and systematic addressing of time series-specific challenges. By utilizing the Time Series Pile and innovative strategies, MOMENT demonstrates high performance in pre-training transformer models of various sizes. Also, the study designs an experimental benchmark for evaluating time series foundation models across multiple practical tasks, particularly emphasizing scenarios with limited computational resources and supervision. MOMENT exhibits effectiveness across various tasks, showcasing superior performance, especially in anomaly detection and classification, attributed to its pre-training. The research also underscores the viability of smaller statistical and shallower deep learning methods across many tasks. Ultimately, the study aims to advance open science by releasing the Time Series Pile, along with code, model weights, and training logs, fostering collaboration and further advancements in time series analysis.


Check out the Paper and GitHub. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter. Join our Telegram Channel, Discord Channel, and LinkedIn Group.

If you like our work, you will love our newsletter..

Don’t Forget to join our 42k+ ML SubReddit


Asjad is an intern consultant at Marktechpost. He is persuing B.Tech in mechanical engineering at the Indian Institute of Technology, Kharagpur. Asjad is a Machine learning and deep learning enthusiast who is always researching the applications of machine learning in healthcare.






Source link

15May

Marker: A New Python-based Library that Converts PDF to Markdown Quickly and Accurately


The need to convert PDF documents into more manageable and editable formats like markdowns is increasingly vital, especially for those dealing with academic and scientific materials. These PDFs often contain complex elements such as multi-language text, tables, code blocks, and mathematical equations. The primary challenge in converting these documents lies in accurately maintaining the original layout, formatting, and content, which standard text converters often need help to handle.

There are already some solutions available aimed at extracting text from PDFs. Optical Character Recognition (OCR) tools are commonly used to interpret and digitize the text contained within these files. However, while these tools can handle straightforward text extraction, they frequently need to improve when preserving the intricate layouts of academic and scientific documents. Issues such as misaligned tables, misplaced text fragments, and loss of critical formatting are commonplace, leading to outputs that require significant manual correction to be helpful.

In response to these challenges, a new tool called “Marker” has been developed that significantly enhances the accuracy and utility of converting PDFs into markdown. Marker is designed to tackle the complexities of high-density information documents like books and research papers. It supports extensive document types and is optimized for content in any language. Crucially, Marker not only extracts text but also carefully maintains the structure and formatting of the original PDF, including accurately converting tables, code blocks, and most mathematical equations into LaTeX format. Additionally, Marker can extract images from the documents and integrate them appropriately into the resultant markdown files.

It has been finely tuned to efficiently handle large volumes of data, utilizing GPU, CPU, or MPS platforms to optimize processing speed and accuracy. This capability ensures that it operates within a reasonable usage of computational resources, typically requiring around 4GB of VRAM, which is on par with other high-performance document conversion tools. Benchmarks comparing Marker to existing solutions highlight its superior ability to maintain the integrity and layout of complex document formats while ensuring the converted text remains true to the original content.

Further setting Marker apart is its tailored approach to handling different types of PDFs. It is particularly effective with digital PDFs, where the need for OCR is minimized, thus allowing for faster and more accurate conversions. The developers have acknowledged some limitations, such as the occasional imperfect conversion of equations to LaTeX and minor issues with table formatting. 

In conclusion, Marker represents a significant step forward in document conversion technology. It addresses the critical challenges faced by users who need to manage complex documents by providing a solution that not only converts text but also respects and reproduces the original formatting and structure. With its robust performance metrics and adaptability to various document types and languages, Marker is poised to become an essential resource for academics, researchers, and anyone involved in extensive document handling. As digital content grows both in volume and complexity, having reliable tools to facilitate easy and accurate conversion will be paramount.


Niharika is a Technical consulting intern at Marktechpost. She is a third year undergraduate, currently pursuing her B.Tech from Indian Institute of Technology(IIT), Kharagpur. She is a highly enthusiastic individual with a keen interest in Machine learning, Data science and AI and an avid reader of the latest developments in these fields.




Source link

14May

OpenAI Launches ChatGPT Desktop App: Enhancing Productivity for Mac Users


On May 13, OpenAI held its Spring update event, at which the company announced its newest model, GPT-4o, an AI model with a GPT-4 level of intelligence. The “o” in GPT-4o means omnimodal capabilities due to its ability to process and integrate text, vision, and audio. The event was overall good and properly highlighted everything important and relevant, including the major announcement of the official ChatGPT desktop app for Mac.

Before we elaborate on the new ChatGPT Mac desktop app, here are a few worth mentioning:

  • The event started and ended with positivity, and the first big news was that ChatGPT has over 100 million users worldwide. 
  • The most exciting announcement was the introduction of its new model, GPT-4o. This new model is accessible to all. It will be available for free to all ChatGPT users.
  • The new GPT-4o can better understand and respond to voice commands, allowing you to interrupt between the responses to change the topic or tone or get a better response output. 
  • The new and improved vision capability can see what’s going around you and respond to you accordingly, and even help you solve complex questions. 
  • The new model can even assist you with coding in real-time just by using the desktop app by highlighting and pressing cmd + C. But for now, this is only available on the Macs.
  • Lastly, the new model’s speech translation ability was one of its highlights because of its speed and lack of awkward lags in between.

The most shocking and interesting news was the ChatGPT desktop application for Macs. It surprised a few people as Microsoft had invested over $13 billion in OpenAI. Yet, OpenAI came up with the Mac application first, possibly due to the highly rumored Apple and OpenAI deal, which is said to be close to being a done deal.

As suggested, the Mac application is only available to macOS users and requires macOS 14.0 or later. Although the news is big, not everyone is happy because, as stated before, investors, like Microsoft, expressed disappointment that the first desktop app release was for macOS and not Windows. However, OpenAI has assured that they plan to launch a Windows version later this year, as most of their users are Windows users.

The ChatGPT app can also live and stay pinned in your taskbar for quick and easy access. 

The Mac application can monitor your screen and appear on any opened tab. 

You can easily share any picture or screenshot with it using the drag-and-drop option.

Within seconds, you will receive a response related to the image.

Finally, the app can communicate with you like an assistant and present you with appropriate answers to your queries.

In Conclusion:

The OpenAI Spring update event was quite eventful and exciting. The introduction of the new GPT-4o model with its omnimodal capabilities is particularly intriguing. Additionally, the ChatGPT desktop app announcement for Mac is a significant development. Some are disappointed by the lack of a Windows version, but that will not last long. Overall, it’s clear that OpenAI is making impressive steps in AI technology. Now we wait and hope to get GPT-5 later this year!


Nishant, the Product Growth Manager at Marktechpost, is interested in learning about artificial intelligence (AI), what it can do, and its development. His passion for trying something new and giving it a creative twist helps him intersect marketing with tech. He is assisting the company in leading toward growth and market recognition.




Source link

14May

Machine Learning Engineer – Ads (Off-Platform) (Remote


Are you intrigued by data? Yelp has hundreds of millions of pieces of user-contributed content, millions of users and business listings, and hundreds of thousands of advertising customers — and all of these numbers are constantly growing.

 

As a Machine Learning Engineer within the Ads group, you will be joining a team that is responsible for optimizing the delivery of ads outside of the Yelp platform. In this role you will turn raw data into valuable signals by building elegant, scalable systems that use data warehouses, batch processing, and stream processing solutions.

 

Yelp engineering culture is driven by our values: we’re a cooperative team that values individual authenticity and encourages creative solutions to problems. All new engineers deploy working code their first week, and we strive to broaden individual impact with support from managers, mentors, and teams. At the end of the day, we’re all about helping our users, growing as engineers, and having fun in a collaborative environment.

 

This opportunity requires you to be located in the Republic of Ireland. We’d love to have you apply, even if you don’t feel you meet every single requirement in this posting. At Yelp, we’re looking for great people, not just those who simply check off all the boxes.



Source link

13May

Chief Information Security Officer at Pinsent Masons


Job Title:  Chief Information Security Officer

Location: Birmingham or London

Hours of work: 9.30am–5.30pm (Variable), Monday to Friday. Some out of hours & weekend work (flexibility and travel required).

Reports to: Chief Technology Officer; Dotted line to Chief Operating Officer

About the Business: 

Here at Pinsent Masons we bring together the best people to get the job done. We’re naturally curious, constantly learning, listening, and growing. We’ll truly value your ideas. You’ll be joining an award-winning, hardworking and commercially minded team, where you’ll have the opportunity to work with leading experts and form meaningful relationships, while making a difference. You’ll get the opportunity to be involved in varied and challenging work. Working in an open and supportive environment, to deliver outstanding results. 

Purpose of the role: 

This role is responsible for the overall security posture of the organisation, ensuring the organisation’s information and technology assets are protected from internal and external threats.

The Chief Information Security Officer’s role is to provide vision and leadership for developing, implementing and supporting the firm’s cyber security strategy, and owning the delivery and operations of the programme of work.

The Chief Information Security Officer owns the planning and implementation of the cyber security programme, and ensures delivery in conjunction with peers, subject matter experts and business partners.

This individual is also responsible for ensuring compliance with all regulatory requirements and implementing and enforcing all security policies and procedures.

Candidate Overview:

We are looking for candidates who ideally hold the following skills and experience: 

  • At least 5 years in a senior leadership role in information / cyber security
  • Knowledge of common cyber security/ information security management frameworks including ISO 27001, NIST, Cyber Essentials, CIS
  • Wide ranging knowledge and experience of cyber security technologies, strategies, and information security risk management
  • Experience in the legal industry is highly desirable, with an understanding of the unique challenges and regulatory requirements.
  • Knowledge of legal and regulatory requirements related to cyber security and how these differ by jurisdiction.
  • Experience of developing and maintaining policies and procedures related to end-to-end cyber security management.
  • Experience of managing incidence response teams
  • Proven experience with creating, developing and leading teams.
  • Experience with building relationships at all levels, with internal and external stakeholders and business partners.
  • Experience of managing risks and issues, involving the right experts and individuals at the right time.

Qualification: 

  • Industry recognized certifications in Information Security (e.g. CISSP, CISM, CISA).

What can we offer you?

  • Agile working (the opportunity to work from home, subject to  commitments)
  • Carers’ leave (up to five paid days’ leave towards caring responsibilities) 
  • 25 days’ annual leave entitlement and the opportunity to purchase or roll over 5 days.
  • Contributory pension of up to 5%.
  • Private healthcare policy 
  • Death in service cover (4 x base salary).
  • Cycle to work scheme.

What happens next? 

Once your application has been submitted and reviewed, our Recruitment team will share the outcome with you by email. 

We typically hold two interview stages per vacancy providing the opportunity to meet two members of the hiring team at each stage. The first stage is typically conducted virtually and the second stage typically in person at the office in which the role would be based. However, we strive to remain flexible depending on the requirements of the role or the candidate. 

Our strength lies in our differences.

We are a Disability Confident and top Stonewall employer, a Valuable 500 member, a founding member of the Mindful Business Charter, signatory of the Race at Work Charter and a proud partner of Neurodiversity in Law. We encourage and value different ideas and styles of thinking. It’s with different perspectives that we’ll find solutions to our clients’ most complex challenges. It’s how we’ll deliver outstanding results today, and tomorrow. We want everybody attending an interview to be comfortable and able to fully demonstrate their experience and talents. 

#LI-BOSD



Source link

13May

MISATO: A Machine Learning Dataset of Protein-Ligand Complexes for Structure-based Drug Discovery


In the dynamic field of AI technology, a pressing challenge for the drug discovery (DD) community, especially in structural biology and computational chemistry, is the creation of innovative models finely tuned for drug design. The core challenge lies in accurately and efficiently predicting molecular properties crucial for understanding protein-ligand interactions and optimizing binding affinities, essential for advancing effective drug development initiatives.

In current structural biology and drug design, researchers commonly depend on existing datasets and methods, which have inherent limitations like structural inaccuracies, crystallographic artifacts, and difficulties in accurately capturing the dynamic nature of protein-ligand interactions. Traditional approaches for predicting molecular properties often lack the necessary detail for complex protein-ligand interactions, neglecting the vital role of dynamics and flexibility in understanding binding mechanisms and affinity.

Researchers from the Institute of Structural Biology, Technical University of Munich, Jülich Supercomputing Centre, Helmholtz AI, Cambridge University, Jagiellonian University, and Institute of Computational Biology propose MISATO, marking a transformative shift in drug discovery and structural biology methodologies. MISATO addresses the limitations of existing methods by integrating quantum-chemically refined ligand data, molecular dynamics (MD) simulations, and advanced AI models. This comprehensive approach facilitates a nuanced understanding of molecular properties, capturing electronic structure details and dynamic behavior crucial for accurate predictions. 

MISATO takes a comprehensive approach, utilizing semi-empirical quantum chemical methods to refine ligand datasets. This method captures electronic properties with high accuracy, while also analyzing both electronic structure details and dynamic behavior, crucial for precise predictions. Additionally, classical MD simulations within MISATO characterize the dynamic behavior and conformational landscape of protein-ligand complexes, offering insights into binding mechanisms and flexibility. AI models integrated into MISATO, such as graph neural networks (GNNs), are trained on this enriched dataset to predict properties like adaptability, binding affinities, and thermodynamic parameters. Extensive experimental validations confirm the efficacy of these models in accurately predicting key molecular properties crucial for drug discovery.

In conclusion, MISATO signifies a key stride in AI-driven drug discovery and structural biology. By integrating quantum chemistry, MD simulations, and advanced AI models, MISATO provides a holistic and robust solution to challenges in structure-based drug design, enhancing accuracy and efficiency and empowering researchers with potent tools.


Check out the Paper. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter. Join our Telegram Channel, Discord Channel, and LinkedIn Group.

If you like our work, you will love our newsletter..

Don’t Forget to join our 42k+ ML SubReddit


Aswin AK is a consulting intern at MarkTechPost. He is pursuing his Dual Degree at the Indian Institute of Technology, Kharagpur. He is passionate about data science and machine learning, bringing a strong academic background and hands-on experience in solving real-life cross-domain challenges.






Source link

13May

Head of Data Science, LTV at Trainline


We are champions of rail, inspired to build a greener, more sustainable future of travel. Our purpose is our momentum. It makes us feel good because we know we’re doing good. As we lead the way to a greener future, we do it together. We’re all about connections – with each other, with our customers and with the world. Just as our platform brings the world together, it’s our ambition that connects us. We motivate each other to go beyond our limits, to experiment, to fail and to always grow. 

With over 110 million visits every month to our platform and £4.3 billion in net ticket sales, we’re always innovating and making moves towards our final destination — a world where travel is as simple, seamless, and affordable as it should be. 

And we couldn’t do any of it without our incredible people driving us forward. Today, we’re a FTSE 250 company that’s proudly home to more than 1000 Trainliners from over 60 nationalities across offices in London, Paris, Barcelona, Milan, Edinburgh, Berlin, Madrid and Brussels. It’s this diversity that energises us and makes us stronger, helping us to achieve amazing things. 

With our sights firmly set on further European growth, there is no better time to jump on board this high-speed train and be part of our continued success. 

Great journeys start with Trainline. 

Introducing Data Science at Trainline👋  

Data Science is central to how we build products, delight our customers and grow our business. Our Data Scientists are embedded in cross-functional teams which exist across product and marketing.  Data Scientists have a high degree of autonomy and are empowered to drive the success of their teams, using all data and techniques at their disposal.      

As a Head of Data Science you will manage a team of highly functioning team of Data Scientists and Data Science Managers in the Growth pillar of Trainline. You will also be the joint lead of this cross functional pillar which comprises of ~150 people across Product, Engineering, Marketing and broader data functions.  

The pillar itself is focused on driving the growth engine of trainline, with a large Marketing footprint across all major marketing channels operating in a heavily data driven way to acquire users, a portion of the team focused on driving up retention and LTV through marketing, CRM and onsite activities and portion of the team focused on driving up revenue per user through partnerships and advertising. Getting this growth loop working well is the key to Trainline’s success and this therefore represents an exciting opportunity in the team. 

As a Data Science leader you are responsible for ensuring that your team is contributing to the short and long term decision making in all parts of the Growth pillar as they embed in teams, this could range from building out attribution models, LTV predictions, incrementality testing etc as well as guiding each team to set ambitious north star strategies and working towards these in a data driven way. 

As a leader of the Growth pillar you will work with your Product, Marketing, Engineering and Commercial counterparts to set out the long term strategy for growth at Trainline and ensure we execute on this within the team whilst being responsible for communicating this vision and progress to it to senior leadership in the company. You will also have a close working relationship with Data Engineering, Machine Learning and BI teams as the data representative in the Pillar leads group to ensure all of data is working on the right problems to move Trainline forwards. 

Management experience is required, as is a marketing experience with a history of driving growth using multiple levers and a strong history of being able to drive decisions with data and history of management. 

Data science at Trainline exists within the wider data organisation as part of the tech org, and is complemented by data engineering teams, data platform teams, and ML teams for when deep ML and AI techniques are required. Our autonomous model creates a huge opportunity for personal influence and impact – as the data scientist on the team you will be actively driving innovation on the team by contributing to strategy, execution and continuous learning.  

As a Head of Data Science at Trainline, you will…🚄  

Be responsible for influencing product, marketing and business outcomes, have the autonomy to make things happen and must obsess about having business impact. More specifically you will: 

  • Lead a team of ~12 Embedded DS and Managers 
  • Drive a high standard of work and hold a high bar for impact within your org 
  • Mature how we achieve growth in a data driven way across your team 
  • Lead, with your cross functional counterparts, the strategy and delivery of the Growth pillar of Trainline of ~130 people 
  • Think big, clearly setting out a strategy and ensuring data driven execution 

We’d love to hear from you if you have…🔍  

  • Proven experience in leading data driven teams in the growth and marketing space  
  • Experience managing a data driven team and holding a high bar for analysis for 3+ years 
  • Experience in driving growth in an online product for 6+ years 
  • Experience setting the strategic direction and thinking big 
  • Ability to distil and communicate results of complex analysis clearly and effectively to all levels including senior management  
  • Experience of marketing evaluation and measurement of success. For example, running holdout/incrementality testing to evaluate campaign effectiveness or deploying new bidding models and understanding their impact  
  • Ability to navigate data sets of varying complexity/ambiguity and conduct analysis to derive clear insights and actionable results  
  • Strong PowerPoint and presentation/communication skills  
  • Strong data visualisation skills using tools like Tableau, Spotfire, Power BI etc. 
  • Knowledge of statistical techniques like econometric modelling (desired) 
  • Tech Stack: SQL, Python, R, Tableau, Power BI, AWS Athena + More!  

Why should you jump on board? 

We pay special attention to learning and development and organise quarterly company learning days as well as offering a learning budget that can be put towards resources of your choice. We will cover the costs of your professional subscriptions and give you access to our very own learning platform. 

At Trainline, we care about the wellness of our employees. We host puppy therapy sessions, in-office yoga and run Mental Health First Aider training courses as well as having an Employee Assistance Program as one of our many company benefits.  

We regularly throw fun social events such pub quizzes, karaoke nights and our large-scale Summer and Winter Festivals every year. Additionally, we love hosting meetups in our amazing event spaces and having the opportunity to support internal and external community groups.  

We also hold companywide hackathons and our annual Trainline Tech Summit, which provides Trainliners with an opportunity to stand up and share their story, learnings, or new skills with their colleagues in a safe environment. 

Our flexi-first approach

We believe in the importance of a healthy work-life balance and the value of a flexible workforce. Our flexi-first approach outlines our commitment to a hybrid way of working and our expectations of Trainliners. A key part of what makes Trainline special is our people and the value we get from the buzz and energy of our workplaces, and that’s why we’re proud to offer the best of both worlds. In practice this means in–office attendance at least 40% of the time over a 12-week period for all Trainliners. These in-office days are typically team led to help us connect, collaborate and create together.  

Our Values 

  • Think Big – We’re building the future of rail 
  • Own It – We care about every customer, partner and journey 
  • Do Good – We make a positive impact 
  • Travel Together – We’re one team 

Interested in finding out more about what it’s like to work at Trainline? Why not check out what our employees say about us on Glassdoor? You can also find out more information by following us on LinkedIn or our ‘Life at Trainline’ Instagram account.  

We value open expression at Trainline, we believe it’s the diversity of experience, backgrounds and perspectives of our employees that makes us who we are. We encourage everybody to play a part in changing the way people travel across the world. 





Source link

13May

How ‘Chain of Thought’ Makes Transformers Smarter


Large Language Models (LLMs) like GPT-3 and ChatGPT exhibit exceptional capabilities in complex reasoning tasks such as mathematical problem-solving and code generation, far surpassing standard supervised machine learning techniques. The key to unlocking these advanced reasoning abilities lies in the chain of thought (CoT), which refers to the ability of the model to generate intermediate reasoning steps before arriving at the final answer, kind of like how we humans break down a complex problem into smaller steps in our head. This can be achieved through methods like training the model on examples enriched with intermediate reasoning steps or using few-shot prompting to instruct the model to generate a CoT.

Now, you might think that the contents of these intermediate steps is what allows the model to reason better. But interestingly, in this study, the researchers found that even if the intermediate steps are incorrect or completely random, just the act of generating them still helps the model a lot. It’s like the model is being told “Okay, think this through step-by-step” and that alone improves its reasoning ability drastically.

So the researchers wanted to understand why this “chain of thought” approach is so powerful for transformers (the type of model used in GPT-3, etc). They used concepts from circuit complexity theory and adopted the language of computational complexity classes like NC, AC, and TC to analyze this problem.

Essentially, they found that without the chain of thought, transformers are limited to efficiently performing only parallel computations, meaning they can solve problems that can be broken down into independent sub-tasks that can be computed simultaneously.

However, many complex reasoning tasks require inherently serial computations, where one step follows from the previous step. And this is where the chain of thought helps transformers a lot. By generating step-by-step reasoning, the model can perform many more serial computations than it could without CoT.

The researchers proved theoretically that while a basic transformer without CoT can only solve problems up to a certain complexity level, allowing a polynomial number of CoT steps makes transformers powerful enough to solve almost any computationally hard problem, at least from a theoretical perspective.

To back up their theory, they also did some experiments on different arithmetic tasks – ones that can be parallelized and ones that inherently require sequential computations. Sure enough, they found that transformers struggled on the sequential tasks without CoT, but enabling CoT drastically boosted their performance, especially when the transformer model was relatively small/shallow.

In essence, the chain of thought is a simple but powerful trick that vastly increases the reasoning capabilities of transformer models like GPT-3. It allows them to tackle complex tasks requiring sequential logic that parallel models would fail at. 


Check out the PaperAll credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter. Join our Telegram Channel, Discord Channel, and LinkedIn Group.

If you like our work, you will love our newsletter..

Don’t Forget to join our 42k+ ML SubReddit


Vineet Kumar is a consulting intern at MarktechPost. He is currently pursuing his BS from the Indian Institute of Technology(IIT), Kanpur. He is a Machine Learning enthusiast. He is passionate about research and the latest advancements in Deep Learning, Computer Vision, and related fields.






Source link

12May

Chief Information Security Officer at Veolia


Veolia Group aims to be the benchmark company for ecological transformation. With nearly 220,000 employees worldwide, the Group designs and provides game-changing solutions that are both useful and practical for water, waste and energy management. Through its three complementary business activities, Veolia helps to develop access to resources, preserve available resources and replenish them. In 2021, the Veolia group provided 79 million inhabitants with drinking water and 61 million with sanitation, produced nearly 48 million megawatt hours and recovered 48 million tonnes of waste. Veolia Environment (Paris Euronext: VIE) achieved consolidated revenue of 28,508 billion euros in 2021. www.veolia.com

Position Purpose

Implement and maintain an information security program covering the entire organization. Evaluates risks, threats and consequences in order to establish an appropriate prevention plan. Establish policies and standards as necessary for governance of the information security program. Provide an advisory role, support, information, training, and alert to other departments.

 

Primary Duties / Responsibilities

  1. Lead the Enterprise Information Security Group. Drive the design and execution of the information security strategy, work in partnership with various key stakeholders (Risk Management, Technology, Legal, Human Resources, Lines of Business Management, etc.)

  2. Serve as the senior spokesperson for information security, including communicating key issues, risks, and progress to governance committees, business executives, Regulators, and the Board of Directors.

  3. Build and Lead the Information Security Steering Committee.

  4. Monitor and measure progress and highlight/escalate issues.

  5. Build, retain and develop a team of top cyber security talent.

  6. Design and operate a Security Operations Center to promptly identify and respond to security issues/anomalies. Execute and maintain response processes to ensure timely response to detected cybersecurity events. Contain and mitigate incidents and newly identified vulnerabilities.

  7. Build and run a risk assessment program that includes comprehensive technical assessments of applications and infrastructure, penetration tests, and security architecture assessments. Ensure the provision of data security subject matter expertise to project teams to ensure early identification of data security requirements. Categorize and prioritize assessment risks for remediation.

  • Design and run an information security metrics/reporting program. In addition, produce information security reports as required, including Regulatory reports.

  • Ensure readiness for regulatory and internal audit examinations. Timely respond to inquiries and ensure suitability and timely execution of corrective action plans.

  • Build and run training and awareness programs to educate and alert staff, third parties, and clients to key risks and the behaviors and actions required to mitigate risks.

  • Build strong and effective relationships with key staff and support initiatives to advance information security capabilities.

  • Actively engage with industry associations and develop industry relationships. Stay abreast of evolving threats/risks.

  • Oversee the Enterprise Information Security Group’s projects and guide the projects to on-time and on-budget delivery. Ensure transparency of key project risks.

  • Serve as the owner of the information security policy and oversee the policy exception management process. Evolve policy and standards to account for new technologies, changing regulations, threats, and risks.

  • Contribute to the leadership team’s success by influencing decisions, leading, and supporting initiatives.

  • Conduct career planning with assigned staff.

  • Mentor staff members to ensure their goals align with BU/Domain goals and the staff members are growing

  • Execute projects in Agile (or at appropriate times Waterfall) methodologies.

  • Function as PM or Scrum Lead to ensure projects are delivered on time, on budget with the desired outcomes

  • Implement analytics to measure and ensure adoption, taking corrective action when required.

Education / Experience / Background

  • Bachelor’s degree in Computer Science, Information Systems or a related field required; Related Master’s degree, preferred
  • 10+ years’ experience in a production IT environment managing enterprise IT infrastructure, hardware, hosting service and network areas.

  • 8+ years designing and building a conforming cyber security posture that aligns with the Group’s mission and strategy

  • 5+ years of leadership experience, with a focus on cybersecurity

 

Knowledge / Skills / Abilities

 

  • Experience managing and architecting components of cyber-secure positions

  • Experience with cyber remediation and reporting

  • Ability to understand business drivers in order to organize and prioritize multiple competing deadlines and assign resources accordingly

  • Strong communication, analytical and problem-solving skills with the ability to drive actionable changes

 

Required Certification / Licenses / Training

 

 

Physical Requirements / Work Environment

  • Prolonged periods sitting at a desk and working on a computer or tablet
  • Travel 35% of the time (domestic and international)

BENEFITS

Veolia’s comprehensive benefits package includes paid time off policies, as well as health, dental and vision insurance. In addition, employees are also entitled to participate in an employer sponsored 401(k) plan, to save for retirement.  Pay and benefits for employees represented by a union are outlined in their collective bargaining agreement.

 

A subsidiary of Veolia group, Veolia North America (VNA) offers a full spectrum of water, waste and energy management services, including water and wastewater treatment, commercial and hazardous waste collection and disposal, energy consulting and resource recovery. VNA helps commercial, industrial, healthcare, higher education and municipality customers throughout North America. Headquartered in Boston, Mass., Veolia North America has more than 10,000 employees working at more than 350 locations across the continent. www.veolianorthamerica.com

We are an Equal Opportunity Employer! All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability or protected veteran status.

Disclaimer: The salary, other compensation, and benefits information is accurate as of the date of this posting. The Company reserves the right to modify this information at any time, subject to applicable law.



Source link

Protected by Security by CleanTalk